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The concept of time emerges as an ordering structure in a classical statistical ensemble. Prob- 
ability distributions Prit) at a given time t obtain by integrating out the past and future. We 
discuss all-time probability distributions th at rea lize a unitary time evolution as described by rota- 
tions of the real wave function qTit) = ±y/pA¥). We establish a map to quantum physics and the 
Schrodinger equation. Suitable classical observables are mapped to quantum operators. The non- 
commutativity of the operator product is traced back to the incomplete statistics of the local-time 
subsystem. Our investigation of classical statistics is based on two-level observables that take the 
values one or zero. Then the wave functions can be mapped to elements of a Grassmann algebra. 
Quantum field theories for fermions arise naturally from our formulation of probabilistic time. 
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What is time? This question touches a basic concept in 
physics. In many theoretical formulations, such as mechan- 
ics or quantum field theory, time appears as an "a priori 
I— notion". Time looses its absolute character by the unifi- 
cation of space and time in special and general relativity. 
I Still, spacetime is assumed to exist a priori in these the- 
^^ries, although time intervals and distances depend now 
f-H on the particular inertial system and metric and therefore 
I— iljecome dynamical rather than a priori properties. We pur- 
sue in this paper the notion that time (or more generally 
spacetime) arises as an effective concept in a theoretical 
setting which does not postulate its existence from the be- 
ginning. More precisely, we explore the possibility that 
ly-jtime emerges as an ordering structure in a probabilistic 
fSl setting based on a classical statistical ensemble. As such, it 
^vgip not distinguished from other possible structures among 
^^1?he observables and appears as a derived quantity. Not 
all possible probability distributions for the states of the 
T-H ensemble are compatible with the concept of time. Only 
A subclass of the probability distributions allows the exis- 
tence of time, whereas for others "there is no time" . Such a 
setting opens the logical possibility that time could appear 
as an approximate concept. It provides a formal framework 
for discussions of the idea that the notion of time may loose 
its validity under extreme circumstances as the "beginning 
of the universe." 

Many thoughts have been given in the past to the idea of 
an "emergence of time" from a more fundamental setting 
[ll-[T]| . Our approach is close to general statistics [l| , where 
the emergence of space and geometry has been discussed 
previously 0. Since the basic concept of a statistical en- 
semble contains no notion of a different signature of the 
metric for space and time, this crucial difference between 
space and time has to be traced back to a difference in the 
relevant structures among observables. We will see that 
the particularity of time is associated to the presence of a 
"unitary evolution law" which singles out time as compared 
to space. 

The starting point of our approach is the description 
of Nature in terms of a classical statistical ensemble with 



X 
S3 



positive probabilities. Observables have fixed values in the 
"classical states" of the ensemble. The association of the 
abstract observables with real observations becomes only 
possible once appropriate general structures among observ- 
ables are identified - those structures include time and 
space. We will find that quantum physics and quantum 
field theory emerge naturally in our approach. Once these 
structures are identified we can proceed to an association 
between quantum operators and observations. In this pa- 
per we remain on a rather abstract level. Many different 
associations of physical observations which observables in a 
classical ensemble describing, for example, two-state quan- 
tum mechanics can be made. A special role is played by 
the Hamiltonian operator which is directly related to the 
concept of time-evolution. 

While quantum physics is always related to a probabilis- 
tic description of Nature, we find in our approach that all 
quantum features can be described only in terms of posi- 
tive "classical" probabilities. In the usual view the quan- 
tum wave function ij: carries more information than the 
probabilities which are associated to jV'p. This additional 
information is stored in the phase of the wave function. It is 
therefore widely believed that quantum physics cannot be 
expressed in terms of positive probabilities alone. In con- 
trast, our investigation reveals that all quantum features 
can indeed be described in terms of classical probabilities. 
In our context this is closely related to the emergence of 
time. The quantum features of a wave function and the 
non-commuting operators at a given time arise from a clas- 
sical statistical ensemble for all times. At a given time the 
statistical description is incomplete since the statistical in- 
formation concerning the past and future is "integrated 
out". 

We work within a general probabilistic setting where one 
assumes the existence of one reality, while physical laws 
allow us only to compute probabilities. (Deterministic laws 
are special cases for probabilities extremely close to one or 
zero.) Such a setting may be considered as fundamental 
(probabilistic realism). However, our discussion also covers 
macroscopic statistical systems where an effective concept 
of evolution and time is not necessarily directly related to 
fundamental time. 

As a simple example we may consider an Ising-type 
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model on a two-dimensional square lattice with discrete 
lattice points labeled by x'~' and x^. For each lattice site 
one has an Ising spin that can only take two values, or 
an associated occupation number n(x^,x^) that takes the 
values zero or one. The possible association with bits is 
obvious. The states u are configurations of Ising spins or 
ordered sequences of bits. We consider a classical ensem- 
ble with one given probability distribution {p^j}- More 
formally, we associate to each state uj a positive number 
< Pa; < 1, with normalization J^ljP'^ ~ ^■ 

We ask the question for which type of {pu:} one of the 
lattice directions, say the direction of cc", will appear as a 
time direction, and the other, say the direction of 
a space direction. The structures of time and space can 
be linked to the behavior of correlation functions ^. We 
may consider connected correlation functions in different 
directions, as 

Go{d;x) = {n{x° + d,x'^)n{x°,x^))c, 

Gi{d;x) = {n{x°,x^ +d)n{x°,x^))c. (1) 

Time structures are often periodic, resulting in a periodic 
correlation function Go{ndo) = Go(0), for a suitable do and 
integer n, while space structures are typically reflected by 
an exponential or power law decay of Gi{d) for large \d\. 
We want to formalize these different types of structures and 
find out what type of probability distributions {puj} realize 
a time structure. Time will be associated with appropriate 
properties of an "evolution law" that allows the compu- 
tation of effective probabilities {pr{x'^)} for some "time" 
t — x^, and relates {pt{x° + e)} to {pt{x^)}- 

The statistical ensemble described by the probability 
distribution {puj} may be called the "all-time ensemble". 
It permits to compute expectation values and correlations 
for occupation numbers at arbitrary times. Knowledge of 
{Puj} would contain the information about measurements 
or events at all times - present, past and future. In con- 
trast, a local (in time) probability distribution {prit)} con- 
tains the information about correlation functions at a given 
time t. A sequence of local probability distributions {prit)} 
for all t may be associated with the state of a "local-time 
subsystem". The sequence of {prit)} contains much less 
information than {puj}- The transition from the all-time 
ensemble to a state of the local-time subsystem therefore 
involves an important "coarse graining of the information" . 
This coarse graining leads to incomplete statistics , [13 
and is responsible for many of the characteristic conceptual 
features of quantum mechanics. 

The "local-time probability distribution" {prit)} and its 
time evolution shows analogies to the transfer matrix for- 
mulation for the Ising model. However, in the conventional 
two-dimensional Ising model both x'^ and x^ are equal foot- 
ing. The correlation functions typically decay for large \d\ 
such that we may associate both and x^ with space di- 
rections. In the present paper we investigate generalized 
Ising- type models where the all-time ensemble {puj} differs 
from the standard Ising model. We concentrate on all-time 
ensembles that lead to an evolution law which allows for 
periodic structures, such that x° can be associated with a 



time direction rather than a space direction. Perhaps sur- 
prisingly, a large class of suitable {pu:} is found to describe 
a quantum field theory for fermions in one time and one 
space dimension. 

The Ising model is just one particular example and we 
will keep our discussion of time structures more general. 
It is rather straightforward to implement in a statistical 
ensemble the structure of an ordered chain. It is sufficient 
that a class of observables exists such that all observables 
in this class depend on some label ti, and that every ti has 
only two neighboring labels ti-i and t^+i. This allows us 
to order the observables by an increasing label ti of their 
"time argument" . (The direction of time" plays no role at 
this level. There may also exist other observables that have 
no "time label" ti.) We will consider here the basic setting 
of two- level observables Na{ti) that can only take values 
one or zero, where the index a is a collective index for 
other properties as location in space or quantum numbers 
of particles. More complicated observables can always be 
reduced to a sufficient number of such yes/no decisions. 
For the Ising model discussed above ti can be associated 
to values of the lattice coordinate x", while a corresponds 
to values of the lattice coordinate x^. 

We assume that the observables Na{ti) take definite val- 
ues for every state of the statistical ensemble. Thus every 
state can be labeled by all the values na{ti) = 0, 1 that the 
observables Na{ti) take in this state. We also assume that 
the sequence of numbers {ria(ii)} sufficient to label the 
states of the classical statistical ensemble, which we will 
denote by = {na{ti)}. Furthermore, we assume that the 
range of a is the same for all ti . Without loss of general- 
ity we can use labels ti = ei, i € Z, such that the chain 
consists of equivalent points in the interval tin < ti < tf. 
Taking the limit e — > at fixed tin , t / results in observables 
depending on a continuous time-coordinate t. (We could 
have started our discussion with a continuous time label 
from the beginning. However, the limiting procedure of a 
discrete setting allows for more conceptual clarity since all 
quantities are well defined for arbitrarily small but nonzero 

At this stage nothing distinguishes time from similar 
"one- dimensional continuous ordering structures" where 
observables depend on some real parameter, as a space 
coordinate, momentum or energy. For all one-dimensional 
ordering structures one can define the notion of "local prob- 
abilities" Prit). (In a more general context, the label t 
is then replaced by a space coordinate x or an energy 
E.) While the general probability distribution associates 
to each state w = {na{ti)^ a positive real number p^, 
with^^Pi^ = 1, the local probabilities depend only on the 
sequence of "local occupation numbers" r = [^^^(i)] at a 
given "time" t. the local probability Prit) obtains by sum- 
ming for given values of r over all probabilities Pi^ for states 
that share the same value of r. In other words, we consider 
in the generalized parameter space spanned by (a, t) a hy- 
persurface at fixed i, and we "integrate out" all information 
concerning the "left and right" or the "past and future" of 
this hypersurface. By construction, the local probabilities 
are normalized for every t, TtrPrit) — 1. 
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A central property of statistical ensembles which admit a 
time structure is the existence of an "evolution law" which 
allows the computation of if Prit — e) is known. (In- 
stead of the "immediate past" at t — e we can also gener- 
alize to a larger range of "past probabilities".) The evolu- 
tion law is the basis for differential "evolution equations" 
for the expectation values of observables as, for example, 
the Schrodinger or Heisenberg equations in quantum me- 
chanics. Indeed, one can define "local observables" which 
involve combinations of Na{t) at a given time t. Their 
expectation values can be computed from Prit) without 
additional information from the past and future. An evo- 
lution law allows predictions for future expectation values 
if the present ones are known. There exists a variety of 
statistical systems which admit an evolution law - for lat- 
tice systems this issue is related to the concept of transfer 
matrices. At this stage we still have not yet addressed the 
particular properties of "time" as compared to some space 
direction. 

The difference between time and space resides in the par- 
ticular properties of the evolution law. (More precisely, the 
properties of the evolution law are responsible for a differ- 
ent signature of the metric for time and space.) This pa- 
per therefore addresses a particular form of the evolution 
law that we call "unitary evolution law". A glance to the 
characteristics of time evolution in nature reveals the om- 
nipresence of oscillation phenomena, as in quantum physics 
or clocks. The evolution law should therefore be compati- 
ble with a periodic time evolution of the local probabilities 
Prit)- This eliminates evolution laws which lead to a mono- 
tonic behavior of Pr(i) as, for example, the approach to a 
fixed point. 

For a realization of unitary evolution laws we introduce 
the concept of a real "wave function" qrit) in classical 
statistics. It equals the square root of Pr{t) up to a sign, 
Prit) — Qrit)- Since Qrit) is a vector with unit length, the 
periodicity in the time evolution can be accounted for nat- 
urally by rotations of this vector. A unitary time evolution 
acts linearly on qrit), i.e. 

qr{t)^J2^-p'ip^*-^^^ (2) 
p 

with rotation matrix R independent of q. This is the sim- 
plest evolution law which is compatible with periodic Prit), 
as compared to the more general non-linear case where R 
depends on q or to evolution laws involving information be- 
yond the immediate past t — e. We will see that this simple 
assumption leads directly to the quantum formalism and 
the Schrodinger equation. Quantum physics arises natu- 
rally from the formulation of an evolution law for "prob- 
abilistic time". We find here an explicit example for the 
emergence of quantum physics from a classical statistical 
ensemble 13]. 

One may ask for which overall probability distributions 
Plo = p{{na{ti)}) the unitary evolution law ^ holds. We 
show that for every given sequence of local probabilities 
{Prit)} a large equivalence class of different {pcj} exists 
which yields this sequence by integrating out the past and 
future. The question if some particular all-time ensemble 



{Pu:} is particularly natural is left to future investigations. 
This issue is not crucial for the investigations of the present 
paper. 

We use the analogy of the sequence {na{ti)} with occu- 
pation numbers for fermions in order to construct a rep- 
resentation of the probability density {prit)} in terms of 
Grassmann variables. If the index a contains a space co- 
ordinate and "internal quantum numbers" , the Grassmann 
algebra describes a quantum field theory for fermions. It 
is remarkable how many basic concepts of fundamental 
physics - quantum theory, Schrodinger equation, fermions, 
Grassmann variables and quantum field theories - arise in a 
natural way from the attempt to formulate a probabilistic 
concept of time. 

This paper is organized as follows. In sect. |IT]we dis- 
cuss the all-time ensemble and introduce as basic building 
blocks the two- level observables Na{ti). They correspond 
to yes/no alternatives, carrying one bit of information, or 
to occupation numbers with values 0, 1. Observables of 
this type are defined at every point ti of an ordered chain 
and we characterize the states uj of the classical statisti- 
cal ensemble by the values of these observables na{ti), i.e. 
cj = {nQ,(ti)}. The ensemble is specified by the probability 
Puj for all states lo. This minimal setting will be our only 
basic conceptual assumption. All correlation functions for 
the observables Na{ti) are well defined by this setting. The 
concept of time emerges if the probability distribution {puj} 
has the characteristic properties mentioned above. 

We define in sect. IIIII the "local-time probability distri- 
bution" Prit). It is local in time and obtains by "integrat- 
ing out" the past and the future, i.e. summing over all 
t' t. The sequence of {prit)} for different t specifies a 
state of the local-time subsystem. We also discuss simple 
examples for all-time probability distributions {puj} that 
lead to periodic local probabilities. One of these examples 
will later turn out to describe two-state quantum mechan- 
ics. We show that the map from the all-time ensemble to 
the local-time subsystem introduces equivalence classes of 
probability distributions. Two all-time probability distri- 
butions are equivalent if they lead to the same state of the 
local-time subsystem. We argue that only the information 
contained in the local-time subsystem is available for a de- 
scription of Nature. 

In sect. IIVI we formulate an essential property of the 
probability distribution {puj} which allows for the struc- 
tures of time and evolution: it is the existence of an evolu- 
tion law which relates pr (t) to Pr (t — e) for two neighboring 
"time points" . In particular, we discuss the "unitary time 
evolution" which amounts to a rotation of the vector qrit), 
where Pr — q^- Sect. IVl establishes the close analogy to 
the time evolution in quantum mechanics. We construct an 
explicit map from the local-time subsystem to a quantum 
system. 

In sect. IVII we begin our discussion of the connection 
between a classical statistical ensemble for the two-level 
observables and fermionic Grassmann variables by a map 
between the local probability distributions and elements 
of a Grassmann algebra, and an associated map between 
two-level observables and Grassmann operators. Sect. I VIII 
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interprets the wave function q-r as the wave function for a 
pure quantum state of a multi-fermion system. We recover 
the quantum formahsm with a Schrodinger equation for 
the wave function. 

As a simple example, we describe in sect. IVIIII a classi- 
cal two state system with a unitary evolution law for the 
classical probabilities. This time evolution is the same as 
for two state quantum mechanics for a spin in a constant 
magnetic field. We explicitly display the Grassmann rep- 
resentation for this system. In sect. IIXI we discuss within 
this example the issue of unequal time correlation functions 
and derivative observables as dtN. We find that dtN{t) 
can be expressed in terms of the local probability distri- 
bution, but it involves an operator that does not commute 
with N{t). Similarly, the expectation values of observables 
Na(t -f A) can be computed from qrit). They are repre- 
sented by off-diagonal operators. We clarify how the emer- 
gence of non-commuting operators is related to the coarse 
graining of the information in the step from the all-time 
ensemble to the local-time subsystem. In sect. |X]we ad- 
dress the connection between correlations of measurements 
and the non-commutative operator product. 

Sect. IXII discusses in more detail the wave function which 
is used to describe the unitary evolution. A complex struc- 
ture allows to map the real wave function Qt (t) in eq. ^ to 
the usual complex wave functions in quantum mechanics. 
We give an example for a classical statistical ensemble lead- 
ing to a complex wave function in sect. IXIII It can describe 
a complex two-component spinor. Sect. IXIIII extends our 
probabilistic description of time to non-relativistic quan- 
tum and classical particles. In sect. IXIVI we address quan- 
tum field theories for fermions. We draw our conclusions 
in sect. |XVl 



II. ALL-TIME ENSEMBLE 

We consider a classical statistical ensemble of a general- 
ized Ising type with R bits. The states lo of this ensemble 
are bit sequences. At this stage no notion of time is in- 
troduced yet. We assume R = TB and label the bits by 
(n, a), n — 1 . . .T, a = 1 . . . B. There are obviously many 
ordering structures of this type. The particular one that 
corresponds to time will be singled out by the existence of 
an evolution law that we discuss in sect. IIVI For the time 
being we simply assume that the appropriate selection is 
made and replace the label n by a discrete time label t. 

For a more general discussion of our setting we assume 
an ordered chain of points that we place at coordinates ti 
at equal distance, ti+i—ti = e. At each point we consider a 
set of numbers na{ti) which can take the values one or zero. 
This may be interpreted as a property a realized at point 
ti {na{ti) — l) or not [na{ti) = O). A possible state uj 
of the system is then specified by an (ordered) sequence of 
numbers rifj for all ti, w = {nj3{t))}. For only one property 
(no index a, /3) it can be viewed as a chain of bits {n{t)} 
taking values or 1, one bit for every ti. On the other 
hand, if we interpret np(ti) for a given ti itself as sequence, 
a state w is a sequence of sequences. A classical statistical 



ensemble is specified by a probability > for every state 
UJ, with Y^ujP^ = 1- 

Classical observables A have a fixed value in every 
state of the ensemble. Their expectation values obey 

(3) 

We concentrate on the "basis observables" (t) which can 
take the values (^Na{t))^ = 1,0 according to 

{N^{t))^ EE N^{t){{nf,{t')}) = n^{t). (4) 

In other words, Na{t) "reads out" the bit a at a given point 
t of the sequence of bits {ri^(i')}: for a given sequence it 
takes the value 1(0) if this sequence has for the particular 
values a and t the number na{t) — 1(0). Expectation 
values and classical correlations can be computed according 
to 

= ^ p{{nf}{t')})nc,^{ti)...na^{tp). (5) 

{nf,(t')} 

We can use the basis observables Na{t) in order to con- 
struct "composite observables" by taking linear combina- 
tions or products of the basis observables. 

We assume a notion of "completeness" by postulating 
that all properties of reality are described by the proba- 
bilities and that no additional information is available. 
We therefore deal with a genuinely probabilistic theory. We 
further assume in this paper that all physical observables 
can be constructed from the occupation numbers Na(t). 
Then our setting is the most general classical statistical set- 
ting for states where the Na{t) have fixed values. If there 
would be distinctions between two states llIi,uj2 which go 
beyond different values for some of the Na{t), this would 
not affect the expectation values of physical observables. 
We can group lui and uj2 into a "combined state" w, with 

Puj PuJi PuJ2 ■ 

The probability distribution {pu} can be characterized 
by the classical correlation functions C'^p^ given by eq. ([5]). 
Let us assume first a finite number of points, i — 1 . . . T, 
and a finite number of "properties" or "species" , a = 
1 . . .B. We then have a total number R = TB of "bits" 
na{t). The classical correlation functions can be linearly 
independent only for p < R. Indeed, Na{ti) is a projection 
operator 

{N^{t^)f ^ N^{tO (6) 

such that every factor Na{ti) can appear at most once in 
the independent correlations. The correlation with p = R 
is unique, since it contains every Na{ti). There are R cor- 
relation functions with p = R— I, which can be labeled by 
the R possible places where one Na{t) is missing from the 
chain of all Nij{ti). We can define the conjugate observable 

N^{ti) = l-N^{t,) (7) 
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which takes the value (-/Vc(<i)) = 1 for all states w where 

(Ncik))^ = and (N^ik))^ = if (A^a(t^))„ = 1. One 
may then relate the correlation functions with p factors of 
N to the correlation functions with p = R — p factors of N. 

We define the manifold V of all possible probability dis- 
tributions {pui} and C the manifold of all independent clas- 
sical correlation functions. Eq. ^ constitutes a map 
V ^ C. This map is injective since two probability dis- 
tributions which yield the same value for all correlation 
functions are identical. Let us denote the image of V in 
C by Cp. We can then find the inverse map Cp ^ V . In 
other words, the probability distribution {pui} can be re- 
constructed from the classical correlation functions. 

We can easily visualize the situation in the simple cases 
of only one or two bits [R — 1,2). For i? = 1 we have only 
two states n = and n = 1, with associated probabilities 
P- and p+ . The only "correlation" is the expectation value 
{p=l) 



(N) =p+-p- 



With p+ + p_ 



1 this yields 
1 



P± 



il±{N)). 



(8) 



(9) 



For two bits (?t.i,?t.2) we have four states, uj = 
[(1,1), (1,0), (0,1), (0,0)], with probabilities = 
,p ]. The expectation values or "occupa- 
tion numbers" obey 

(Ni) = P++ + p+_ , {N2)^ P++ + P-+. (10) 

There is one independent two-point correlation {p = 2) 



{N1N2) = p+ 



+ ■ 



(11) 



Together with '^^Pui = 1 we can compute the proba- 
bilities pu, from {Ni),{N2) and {N1N2). For three bits 
we have three expectation values (Ni) , {N2) , {N3) , three 
two point functions {N1N2), {N1N3), {N2N3) and one three 
point function {N1N2N3). The values of the seven corre- 
lation functions are sufficient for the reconstruction of the 
seven independent probabilities Pi^. This generalizes to an 
arbitrary number of bits R. We have 2^ different states 
Lu, 2^ — 1 independent probabilities p^, and also 2^ — 1 
independent correlation functions. 

The classical correlation functions obtained from eq. ([SJ 
have to obey constraints - these constraints define Cp. Ob- 
viously, all correlation functions obey the bounds 



o<cfJ,j({U)<i- 



(12) 



There are further constraints. For the example of two bits 
the connected two-point-function obeys 



G(2) = {N1N2) - {Ni){N2) 

= P++P — - P+-P-+ e 



1 1 

4' 4 



(13) 



This restriction holds for the connected two-point-function 
also for an arbitrary number bits. If we single out two par- 
ticular bits ni and ri2 out of the R possible bits, we can 



compute the correlations pO)) . pT|) by "integrating out" 
the bits different from ni and 712. This yields an effective 
probability 15++ for finding iVi — l,N2 — 1, which is ob- 
tained by summing over all probabilities for those states 
for which the sequence {na{ti)} has for ni and n2 the value 
one. Similarly, we get p^ — ,p — |- and p--, and X^t^P^^ = 1 

translates to + Ph + P h + P = 1- The relations 

pU)) and (|lll) follow in a straightforward way. 

The map V ^ Cp can be used for a characterization 
of arbitrary statistical systems. If the classical correlation 
functions of the system obey constraints such they belong 
to Cp for an appropriate i?, this system corresponds to a 
classical statistical ensemble with R bits. For this ensem- 
ble, the probability distribution {pu:} can be constructed 
from the full set of classical correlation functions. The 
constraints defining Cp may be used in order to test if a 
given statistical systems as a quantum system or Grass- 
mann functional integral, can be represented by a classical 
i?-bit system. More precisely, the test concerns both the 
existence of the classical ensemble and the use of classical 
correlation functions ([5]). If correlation functions different 
from the classical correlations are used, the system may 
still be represented by a classical ensemble even if the test 
fails. This is important, for example, for the representa- 
tion of quantum systems as classical statistical ensembles, 
where the correlation functions correspond to conditional 
correlations different from the classical ones ^13] . 

The all-time ensemble is described by "complete statis- 
tics" in the sense that all classical correlation functions 
for all observables that are constructed from Na{t) can be 
computed from the all-time probability distribution {pu}- 
We will see later in this paper that this property is lost if 
we describe "subsystems" as the local-time subsystem dis- 
cussed in the next section. Subsystems obtain by a "coarse 
graining of the information" . Subsystem observables are 
those for which the expectation values can be computed 
from the information available for the subsystem. There 
is no guarantee that the classical correlation for two sub- 
system observables is itself a subsystem observable. We 
will discuss this issue which is very important for an un- 
derstanding of quantum mechanics in more detail in sect. 



III. LOCAL-TIME SUBSYSTEM 

In this and the next section we introduce the notion of a 
local-time subsystem. For this purpose one infers from the 
all-time probability distribution {puj} a local-time proba- 
bility distribution {prit)} for any given time t. A state 
of the local-time subsystem is given by the sequence of 
{Prit)} for all times t. The local-time probability distribu- 
tion {prit)} amounts to the usual definition of a statistical 
ensemble for which the probabilities relate to a particular 
time. The time evolution law for this ensemble will specify 
how {prit + e)} can be computed from {pr{t)}. The local- 
time subsystem comprises all sequences {prit)} that obey 
the same evolution law. The different states of the local- 
time subsystem correspond to different sequences {prit)} 
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that all realize the given evolution law. 
1. Locality in time 

So far our considerations hold for arbitrary bits without 
a distinction of the "indices" a and ti. We could consider 
the pair 7 = (a, i^) as a "collective index" labeling all the 
R bits. We next use the split between "internal indices" a 
and "time indices" ti in order to define local probabilities 
Pr(t), where "local" means local in time. This is done by 
summing over all sequences [n^(f')] with t' different from 
t, 

t'^t p n^(t')=0,l 

(14) 

The local probabilities depend on the sequence of bits 
[rt/3(i)] at a given time, where r = [?T.^(t)] can take J\f = 2^ 
values. They are normalized 

Y^Prit)^ E P{[Mt)]-t) ^ 1- (15) 

r [nuit)] 

The index r = 1 . . .J\f counts the number of "local states" 
at a given time t. In eq. (jl4p we have used a notation 
where t singles out a particular time, while t' extends over 
all other times, i.e. 

oj = {np{t');nfs{t)} = {[n^{ti)],[n0{t2)],..., 

IMt - e)], Mt + e)] . . . [npitf)]; Mt)]}. (16) 

For the local-time probability distribution we use the 
equivalent notations 

Pr{t)=p{[np];t) =p{[np{t)];t) ee p([n^(i)]) . (17) 

The observables Na{t) at a given time t play the role 
of local observables. Their expectation values and "equal- 
time correlation" can be computed from p^ (t) , 

= J2 p{[np{t)];t)n^,{t)...n^^{t). (18) 

["/3(t)] 

The general discussion for correlations is the same as above, 
except that all Na are now taken at the same time t, and 
R is replaced by B,a = 1 . . .B. Integrating out the "past 
degrees of freedom" for t' < t and the "future degrees of 
freedom" for t' > t is formally straightforward, using the 
definition of the sums 

E =nn E 
= (n E )(nn e ) 
- E (nn E )• (19) 

[^^(t)] t'/t (3 np{t')=0,l 

Using eq. we obtain eq. from eq. ([5]). 



We will also use the notion of "extended locality" by 
considering a time interval ta < t < tb, with 

p{{np{t)}) = (20) 
n n E p{{Mt' < ta) , Ufsit) , np{t' > h)}). 

t'<ta t'>tb np{t') 

The states {n^(t)} are now sequences of np for all allowed 
t within the interval. Correlation functions of observables 
Na{t) at different times within the interval can be com- 
puted from the information contained in p(|n^(r)}). 

2. Periodic local probabilities 

The reader may get familiar with the concept of local 
probabilities by the discussion of a few simple examples. 
We discuss here periodic local probabilities, since they will 
play an important role in later parts of this paper. We 
concentrate first on only one species, B = 1. Assume that 
the extended local probability density p({n(i)}) obeys the 
relation 

n E p{{^Miho}) 

t^ta,tb n(t)=0,l 

= n E P({0, «(*),!}), (21) 

t^ta,ti, n(t)=0,l 

where the first and last entry in the sequence refer to the 
values of n{ta) and n{tb)- In this case one finds for the 
local probability pi{t) = p{n{t) = l) 

Piih) ^piita). (22) 

Indeed, the two expressions 

= n E {p{^,ri{i),l)+p{l,n{i),0)^ 

i^ta,tb n(t)=Oa 

P^i^b) = n E (p(l,"(t),l) +P(0,n(i),l), 

i^ta,tb n(t)=Oa 

(23) 

are equal by virtue of eq. (PTjl . If the relation (PT|) holds for 
all intervals with fixed size, th — ta = f , one concludes that 
Pi{t) is a periodic function of t with period f,pi{t -I- f ) = 
Pi{t). 

As a first example we take a time chain with only four 
points {T — 4). We choose the probabilities 

p(l, 1,1,0) = p(l,0,l,l) = a;, (24) 
p(l, 1,1,1) = y-x , p{l, 0,1, 0)^1 -y-x, 

and zero otherwise, with < y < I , 0<x< max(?;, 1 — y). 
This yields a period 2e for the local probabilities, with 

Pi{ti) = Piih) = 1 

Pi{t2) = Pi{t4)^y- (25) 

For this example the four probabilities piiti) constitute a 
state of the local-time subsystem. We can formally use 
periodic time by identifying U + Ane = ti , n Cz N . 
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The number of constraints that have to be imposed on 
for obtaining a periodic local probability is typically far less 
than the number of independent p^. For T = 4 and period 
2e the two equations of the type (j25p and the normalization 
of the probability distribution amount to three constraints 
for the 2'^ probabilities p{ni, n2,n^, n^). We conclude that 
many different probability distributions {puj} can lead to 
periodic local probabilities. The particular case (pS)) im- 
poses p(0, 712, 71-3, 71,4) = 7^-2, 0, 714) = and therefore 
leaves only four nonvanishing probabilities. Then the prob- 
abilities ([M]) are the most general solution of eq. (1^51) . Still, 
a given local-time subsystem Pr{t) (pSj) is obtained for a 
whole family of all-time probabilities pu: = ^(711,712,713, 714) 
that is parametrized by a continuous parameter x. The 
map {pui} — i' {Prit)} is not invertible. A state of the local- 
time subsystem retains only part of the information that 
is available for the all-time probability distribution. 

A simple particular way of constructing all-time proba- 
bility distributions leading to periodic local probabilities 
consists in "gluing" extended local probabilities. Con- 
sider extended local probabilities p({7i^(f)}) in the range 
ta l£ t < tb = ta + f — e. We may define an all-time proba- 
bility distribution with t in the range ta < t < ta + mf — e 
by the product form 

p{{np{t)} , {n^{t + f)} , ...{np{t + {m-l)f)}) (26) 

= P{{np{t)})p(^{nf,it + f )}) . . .p(^{npit + (m - l)f) }. 

Using the normalization of the extended local probabili- 
ties p (for every fixed k, and with the product over t in 
the range of the corresponding extended local probability 
distribution), 

n E p{{npit + kf)})^l, (27) 

t+kf ni3(t+kf)=0,l 

we obtain for the extended local probabilities in the interval 
ta+nr < t + nf < f-|-(7i-|-l)f— e the distribution -|- 
nf)}), which is the same for all n. 

The local probabilities are periodic with period f , 

Pr{t + nf) ^Pr{t). (28) 

Here we consider the states r = [ri^] as given bit sequences, 
with n/3 corresponding to 77^3 (i) or 77^(i + 77f) from the 
perspective of the all-time probability distribution. The 
periodicity follows from the general definition (|14l) of the 
local-time probability distribution, employing eq. (j26|) for 
{pui}- Time layers outside the extended time interval to 
which t + nf belongs are integrated out trivially. The only 
remaining sums concern the times that belong to the same 
extended time interval as t-|-77'f , but differ from this specific 
value, 

p(^[nf3(t + nf)]^ 

= n E p{{n0it + nf)},[np{t + nf)]y 

i-{-nf^t-\-nf n0 (i-\-nf) 

= p{[np{t)]). (29) 



The last line uses the fact that the extended local proba- 
bility distributions p are the same for all n. 

As an example with B — 1 , f — 2e, ta = ti , t^ = t2 — 
ti + e we may take 

p(l, 1) = y , p{l, 0) = 1 - y , p(0, 0) = p(0, 1) = 0, (30) 

and consider the product ((26l) for m = 2 

p(l, 1,1,1) = y2 ^ p(l,0,l,0)-(l-7/)2, 

p(l, 1,1,0) = p(l,0,l,l)-y(l-2/) (31) 

(with vanishing p otherwise). This is precisely the distri- 
bution ([M)) for the special value x = y{l — y). 

Probability distributions {puj} leading to arbitrary pe- 
riodic local probabilities Prit) can be always constructed 
by gluing extended local probability distributions. These 
are, however, by far not the only possibilities. The all-time 
probability distribution (IBip is only one particular member 
of the family of probability distributions (|24p that lead to 
the same local-time subsystem (|25l) . 

In this context we note that for glued all-time probability 
distributions the local-time probabilities Pr (t) can be com- 
puted from the extended probability distribution in the 
time interval to which t belongs. This holds for the whole 
sequence {prit)} for times belonging to this interval. Dif- 
ferent extended probability distributions p can lead to the 
same sequence of local probabilities {prit)} within their as- 
sociated time interval. An all-time probability distribution 
that realizes given periodic probabilities (pS)) can therefore 
also be obtained by gluing different p that all lead to the 
same {prit)} within their time interval of definition. 

We discuss next a further example with B = 1 that we 
will use later in this paper. We consider an extended local 
distribution in the range ta < t < ta + for which the 
non-vanishing probabilities are given by 

p(l, 1, 0,1) =p(l, 1,0,0) =p(l, 0,0,1) -p(l, 0,0,0) = i. 

(32) 

By gluing according to eq. (|26l) we obtain a probability 
distribution which leads to a periodic local probability with 
period f = 4e, namely 

Pi{t) = cos^ Ldt , a; = — , (33) 
r 

where t = {n — l)e. The state of the local-time subsystem 
is described by the sequence of local probabilities (with 
integer k) 

Pi{kf) = 1 , pi(kf + e) = i, 

Pi(fcf + 2e) = , piikf + 3e) = i. (34) 

Again, there will be many different all-time probability dis- 
tributions {puj} that lead to the same state of the local-time 
subsystem ([M]) . 

We can extend this setting to a finer grid, f — 8e, while 
keeping eq. p3p unchanged with fixed f and w. We have 
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now to construct extended local probability distributions 
for eight time steps such that Pi(je) — cos^(j7r/8), j = 
. . . 7. While the solution of the system of nine equations 
for the 2^ probabilities p({n(f)}) becomes cumbersome in 
practice, it is obvious that many such solutions exist. Dif- 
ferent solutions can be glued together in order to obtain a 
local-time subsystem obeying eq. (I33|) . Even these many 
possibilities are only a small subset of the all-time distri- 
butions {pi^} that lead to the same state of the local-time 
subsystem p3p. In a similar spirit we can construct prob- 
ability distributions with a periodic local-time subsystem 
([55)1 for arbitrary small values e/f. We can also take the 
limit e — for fixed f and w. We will see in sect. IVIIII 
that that the simple state of the local-time subsystem ([33l) 
describes two-state quantum mechanics. 

3. System observables 

The all-time probability distributions {p^;} that lead to a 
given state of the local-time subsystem (l33l) define standard 
classical statistical ensembles. Let us now assume that only 
the information contained in the sequence of local proba- 
bilities {pr(0} is available for predictions of the outcome 
of measurements. This is a familiar situation, since usually 
all quantities are computed from time-dependent probabili- 
ties, while an all-time probability distribution is not known. 
We therefore have to ask what type of quantities remain 
computable from the limited information contained in the 
state of the local-time system {^^-(t)}, e.g. the periodic 
probabilities ([55)1 . 

"System observables" are those classical observables of 
the all-time ensemble for which the probabilities Wa for 
finding a value \a are computable in terms of the infor- 
mation available for the local-time subsystem. In partic- 
ular, the expectation values remain computable in terms 
of the sequence of local probability distributions 
This extends to expectation values of arbitrary functions 
of a system observable. If the spectrum of an observable 
contains only two values \a it is sufficient that the expec- 
tation value is computable from {pT-(t)}. For observables 
A with a spectrum containing more than two values Aa we 
have to require, in addition, that higher powers of A have 
computable expectation values, i.e. (A), (A^), . . . , etc. are 
computable from {^^-(i)}. 

The occupation numbers Na{t) remain all system ob- 
servables. This extends to the equal time correlation func- 
tions (fTS]) . However, classical correlation functions involv- 
ing occupation numbers at non-equal times are, in general, 
no longer computable from the information contained in 
the local-time subsystem. For example, the classical ob- 
servable Na:{ti)Np{t2) for <i 7^ t2 is not a system observ- 
able. 

One concludes that the local-time subsystem is charac- 
terized by incomplete statistics. While both Na{ti) and 
Np{t2) are system observables, their classical product is 
not. This has important consequences for the definition of 
suitable correlations that describe a sequence of two mea- 
surements in the subsystem, as we will discuss in detail in 
sect. IIXI If wc take the attitude that only the information 
contained in the sequence of local probability distributions 
{pT-(i)} for the local subsystem is available for an observer. 



the correlation between a first measurement of Na at time 
ti, and a second measurement of at time t2, has to 
be computable in terms of {^^.(t)}. This implies that the 
appropriate "measurement correlation" cannot be given by 
the classical correlation function. 

4. Equivalence classes for classical probabilities 

The sequence of local probabilities p([n^(ii)]) for all 
times ti can be used for the definition of an equivalence 
class for the classical probability distribution {puj}- For 
any given {pcj} the sequence of local probabilities can be 
computed in a unique way by eq. (HH). Two probabil- 
ity distributions {pui} and {p^^} are considered as equiva- 
lent if they yield the same local probabilities p([ri^(ii)]) 
for all ti. For a large number of time points ti the num- 
ber 2"^^ — 1 of independent real Pi^, which are needed to 
define the distribution {pu}, exceeds by far the number 
T{2^ — 1) of independent real probabilities Pr{ti) needed 
for the specification of the equivalence class. Typically, a 
given equivalence class contains many different probability 
distributions {p^}- We will see later that for many pur- 
poses the relevant properties of the system only depend on 
the equivalence classes of sequences of local probabilities. 
These equivalence classes correspond to the states of the 
local-time subsystem. 

We may also show that for each arbitrary sequence of lo- 
cal probabilities there exist classical probabilities {pu>} that 
realize this sequence. (None of the equivalence classes is 
empty.) We proceed by induction, as we may first demon- 
strate for B ^ 1. Assume that for m time points of a 
sequence of local probabilities one has found a probability 
distribution p^™) ('^(^1) , ^^(^2), • ■ • , "-(^m)) which yields all 
the wanted local probabilities pi{ti) = p{n{ti) = l). We 
can then construct 

p(™+i)(n(ti),...,n(t„),0) = (35) 
p(™)((ti),...,n(t,„))(l-pi(Wi)), 

in order to continue the sequence of local probabilities with 
an arbitrary pi{tm+i)- For the first member {m = 1) of 
the induction chain we can take p(n{ti) = l) = pi{ti), 
p{n{ti) = 0) =l-pi(ti). 

This procedure can be generalized for arbitrary B. The 
first step starts with p'^^ — Pi{ [^1,3(^1)] • In the second step 
the product 

P^^\[Mti)],[np{t2)]) = Pi[[npiti)])p2[[npit2)]) 

(36) 

realizes the sequence of local probabilities for ti and t2, 
namely pi([n/3)(ti)]) , P2 ( ['1-/3(^2)] ) • This follows directly 
from the normalization (I15p . The third step multiplies p^"^^ 
by Pa ([jT- ,9(^3)]) and so on. We recall that this particu- 
lar construction only constitutes an existence proof that a 
suitable {pi^} exists for every sequence of local probabil- 
ities. Many other and less trivial {pu;} lead to the same 
sequence. 
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IV. EVOLUTION LAW 

We next postulate as a key ingredient for a time struc- 
ture that the local probabilities Prit) can be expressed in 
terms of the neighboring local probabilities Pr{t — e). This 
relation will be called the "law of evolution" . The all-time 
probability distribution = p({fi/3(i)}) contains statisti- 
cal information for arbitrary times, past, present and fu- 
ture. The existence of a law of evolution is a restriction on 
the allowed {puj}. The law of evolution allows us, in prin- 
ciple, to compute correlation functions at some "present 
time" t from probability distributions in the past without 
using any information concerning the future. It is suffi- 
cient to know the probability distribution at some "initial 
time" to, and to solve the law of evolution in order to com- 
pute Prit') for to < t' < t. Most important, the evolution 
law allows us to make probabilistic predictions for future 
expectation values and equal time correlations with time 
arguments tk > t, using only statistical information from 
the present or past correlations. Also the (extended) local 
correlation functions can be determined using much less 
information than contained in the "overall probability dis- 
tribution " {pu:}- If we are interested only in times > to 
the existence of an evolution law is only needed for t > tQ. 

1. Simple periodic evolution 

As an example we consider B — 1 and the periodic state 
([55)) of the local-time subsystem given by pi{t) = cos^wt. 
One can obviously write down a differential evolution law 
to which pi{t) is a solution, as d^pi — —co'^pi. This in- 
volves, however, a second time derivative such that the 
computation of pi (t) requires knowledge of pi {t — e) and 
Pi{t — 2e). We are interested in an evolution law that in- 
volves only one time derivative, such that pi{t) = pi{t — 
e) -|- edtPi {ti — e) is computable from pi (t — e) if dtPi can 
be expressed in terms of pi . Such an evolution law will no 
longer be linear in pi , since 

dtPi = —2uj cosujtsinojt — —2ujsy^pi{l —pi). (37) 

The sign s = ±1 depends on t, with 5 = 1 for t in the 
interval [0,7r/2a;] and s = —1 for t G [7r/2w, tt/w], and pe- 
riodically continued with period t — n/uj. The sign jumps 
whenever pi or 1 — pi equal zero, resulting in dtPi = 0. 
We need this prescription for the sign changes of s in order 
to obtain the correct evolution law. The computation of 
dtpi{t) not only needs pi{t), but also the information if s 
is positive or negative for a given time t. 

A convenient form to combine the information in pi{t) 
and s{t) is the two-component real wave function 

qi{t) = s(i)VwW, 

qo{t) - s{t)s{t)^l~pi{t), (38) 
which is normalized according to 

ql{t)^ql{t) = l. (39) 
The evolution equation 



describes a simple rotation among the two components of 
a unit vector. The overall sign s in eq. (|36|) does not 
affect the evolution equation (|40l) . but the relative sign s 
between qi and go matters. (For definiteness we may take 
s = 1 for t in the interval [—■n/2uj, t: /2lj\ mod f , and s = —1 
otherwise.) 

The evolution equation (j40|) is linear and avoids the ex- 
plicit appearance of a sign factor. It is straightforward 
to compute gi [t) and qo (t) from eq. pOt if qi [t — e) and 
qo{t — e) are known. A first order evolution equation is 
obviously formulated much easier in terms of the real wave 
function (gi(i), (3'o(i)) than in terms of the probabilities. 
We will argue next that evolution equations of the type (|37l) 
or (|40|) arise naturally in the context of periodic probability 
distributions. We will do so by showing first that first or- 
der evolution equations which are linear in the probability 
distribution p{t) usually do not lead to periodic probabili- 
ties. 

2. Transition matrix 

For a general discussion of possible evolution laws we 
first investigate a "linear law of evolution" which can be 
written in terms of a "transition matrix" W 

Pr{t)=Y,Wrp{t.t~e)pp{t~e). (41) 
p 

If W obeys 

Y,Wrp = l, 5]W^rp = l (42) 
r p 

and 

Wrp > 0, (43) 

the matrix element Wrp can be interpreted as the condi- 
tional probability to find the state r at time t if the state 
p is realized at time t — e. Eq. (I43p guarantees that condi- 
tional probabilities (or transition probabilities from p to r) 
are positive, and the first eq. (|^^ ensures the correct nor- 
malization, namely that one of the states r must be found 
at t if /5 is realized at t — e. Eq. (|411) states that the proba- 
bility to find at t the state r equals the sum over all states 
p aX t — e, weighted with their respective probabilities and 
the transition probability from p to r. The second eq. (|42p 
states that every state t at t origins from one of the states 
p at t—e with probability W-rp, and that these probabilities 
sum up to one. Even though the linear evolution law (j4ip - 
(j43p is very intuitive we will see that it cannot describe the 
oscillating probabilities that are omnipresent in nature. 
The evolution law (j4ip entails a composition property 

Prit) = ^I^rp(i,t-e)W^p^(i-e,i-2e)p,(t-2e) 

= Y,Wra{t,t-2e)p^{t-2e). (44) 

cr 

This can be "integrated" , and we obtain for t^ > t2 > ti > 
to the matrix multiplication 



dtqi = -Loqo , dtqo = ojqi 



(40) 



W{t3,ti) = W{t3,t2)W{t2,ti), 



(45) 
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with 

W{t,t) = l. (46) 

We can then express Prit) in terms of an "initial" proba- 
bihty distribution p^(to), 

p,(t) =^W^,p(i,to)Pp(io)- (47) 
p 

We may interpret the probabihties pr (t) as components of 
a vector in R^. The basis vectors of R-^ are the "classical 
states" p for which p'f'' = S!^. The transition matrices act 
then as operators in this space. 

In turn, each classical state is a sequence of of occupa- 
tion numbers for B bits, and we can characterize W by its 
operation on the bits. Consider the example of two bits 
(B = 2,AA=4), wherep= [(1, 1), (1, 0), (0, 1), (0, 0)] . We 
can compose as a sum of "elementary bit operators" 

W = Y1 ^"^^"^ ' ^' = 1 • • • (A^' - !)• (48) 

V 

A typical bit operator is the annihilation operator for 
bit 1 which changes ni = 1 to ni = and yields zero 
when applied to a classical state for which ni = 0. It maps 
(0,0) ^ , (0,1) -> , (1,0) ^ (0,0) , (1,1) ^ (0,1) 
and corresponds to 



/0,0,0,0\ 




/0,0,1,0\ 


0,0,0,0 




0,0,0,1 


1,0,0,0 


0,0,0,0 


V 0,1,0,0/ 




V 0,0,0,0/ 



(49) 



The hermitean conjugate (-Bi^"*)^ = -B^j.^'' is the creation 
operator for bit 1. Other operators are the switch between 
two bits (0,1) 4-> (1,0), the diagonal number operators 
A^i = diag{l,l,0,0), N2 = diag{l, 0,1,0) or their product 
N1N2 = diag{l, 0,0,0). The coefficients Ky have to obey 
restrictions in order to obey the relations (H^ An 
example is W = B^^ + B^_^\ with = 1. 

The simple periodic oscillations of a local two-state prob- 
ability {B — 1), as given by eq. ([551) . cannot be expressed 
by transition matrices obeying eqs. (|42p . (P5| . Indeed, the 
most general form of WVp for B — 1 reads 

^=('7:i-p)- <»' 

If we characterize W\,W-2. by pi,P2, the product W = 
W1W2 = W2W1 has the form 1^ with 

p = pi(l -P2) +P2(1 -pi). (51) 

Thus Wi does not admit an inverse except for pi — 0,1, 
since p differs from zero for arbitrary p2 . An inverse of W 
would be needed if we want to represent the time reflected 
evolution of the local probabilities - we note that eq. (155)) 
is invariant under t ^ —t. 



If we want to realize a continuous evolution in the limit 
e — )• we should have for W{t, t — e) an infinitesimal form 
p = Ae , A ^ 0. The time evolution obeys then the differ- 
ential equation 

dtPi ^ j{pi{t) - pi{t - e)) 

= ^[{Wii-l)piit-e) + Wio{l- pi [t - €)) } 

= -2^ [pi - ^ . (52) 

For A > this corresponds to an irreversible approach to 
the fixed point at pi — pq — 1/2 rather than to an oscil- 
lation. Similar problems persists for a higher number of 
species B and we conclude that an evolution law involving 
transition matrices is often not suitable for a description of 
oscillating local probabilities. There are exceptional cases 
where a linear law of evolution can describe periodic prob- 
abilities and we will mention a few of them later in this 
paper. 

3. Unitary time evolution 

If we want to find a general description of oscillating 
probabilities we have to look for an alternative to the lin- 
ear evolution law (|4T|) - (|43|) . Indeed, there is no need for 
the relation between {prit)} and {pp{to)} to be linear as 
we have seen for the simple example ([57]). An interesting 
alternative to the transition matrix (j47p is a time evolution 
of the local probabilities according to 

Prit) =^Vrpa{t,to)^Jpp{tQ)paito), (53) 

p,<7 

where VVpo- obeys 

Y.VrpAt,to)=6p^. (54) 

r 

This can be cast into the form of an evolution law by taking 
tQ = t — e. It is the type of evolution law obeyed by the 
periodic local probability (|33|) . For only one species {B = 
1) the indices T,p,a take values (1,0) with po — 1 ~ pi. 
It is straightforward to verify that the evolution law ()53l) 
with Vrpait, t- e) given by 

Viii = Vooo = cos^(a;e) , Vim = Von = sin^(tje), (55) 
V^iio = V^ioi = -Voio = -Vooi = -s{t ~ e) cos(a;e) sin(i:je), 

is obeyed by the local probability ([55)) . The sign s(t — e) = 
±1 is positive for the first half period, < t — e < f/2 
and negative for the second, f/2 < t — e < f, with periodic 
continuation. 

The non-linear evolution law ()53p corresponds to a sim- 
ple evolution law for the classical wave function, which 
amounts to the root of the probability distribution up to 
signs. Indeed, simple evolution laws are most easily formu- 
lated in terms of the classical wave function {qr}- For an 
evolution that preserves the norm of the probability distri- 
bution it is rather natural to consider a unit vector q with 
components 

qr = Sr^/p^ , Sr = ±1. (56) 
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The conserved normalization '^^Pr = 1 corresponds to the 
unit length of g, = J2t It ~ J2tPt ~ ^- construc- 
tion the inequality < < 1 is guaranteed for arbitrary 
Qt- Any time evolution that preserves the normalization 
of the probability distribution must keep the length of the 
vector q fixed and therefore corresponds to a rotation of q, 

(lr{t) = ^Rrp{t,tQ)qp{to), 
9 

q{t) - R{t,tn)q{tn) , = 1. (57) 

The simplest form of an evolution law is linear in q, i.e. 
the rotation matrix R does not depend on q. This is re- 
alized by the simple periodic probability distribution p3p 
discussed in sect. IIIII We will assume in the following that 
R is independent of q. Linearity in q entails, however, non- 
linearity in {pt} of the specific form of eq. (j53p . where the 
coefficients Vrpait, io) can be related to the rotation matrix 
RrpitjtQ) up to a sign 

Vrpa{t,to) ^ Rrp{t,tn)RTa{t,to) Sign ((7p(to )'?<t (^o)) • (58) 

Eq. ((M)) follows then from R^R = 1. For the example 
dSll), ([55]) one has for R{t, t - e) 

i?ii = i?oo = cos(a;e) , i?oi = — ^lo = sin(a;e). (59) 

The two-component vector qrit) is given by qi{t) = 
cos(u!t), qo(t) = sin(a;i). 

We will see later that many quantum features can be 
related to the evolution law (1571) which is linear in the clas- 
sical wave function q. It is therefore worthwhile to point 
out that no "quantum features" are required or "put in" in 
the formulation of eq. ([57)1 . It is simply the most straight- 
forward way to realize a norm-conserving time evolution of 
{Prit)} and to realize oscillating probability distributions. 
We also emphasize that the classical wave function is real 
and that the phases characteristic for quantum mechanics 
have to emerge at a later stage, cf. sect. IXIII 

The rotations (j57p lead to a simple composition law, now 
in terms of a matrix multiplication of the rotation matrices 

R{hM) ^ R{hM)R{t2M)- (60) 

An inverse rotation matrix always exists such that a time 
reflected evolution can be realized. Probability distribu- 
tions p({n^(ti)}), which obey the evolution law ([55]) . ([57)) . 

have all the properties necessary for consistently de- 
termining the future from the past in a probabilistic way, 
as described in the beginning of this section. 

We will call the evolution law ([57l) a "unitary evolution" , 
since rotations are a special case of the more general uni- 
tary transformations. In the remainder of this paper we 
will concentrate on unitary evolution laws. Obviously, a 
unitary time evolution is well suited for the description of 
oscillating probabilities. It is sufficient to use periodic ro- 
tations. The infinitesimal form of the unitary evolution law 
(|57l) reads 

dtq^Kq , K^{dtR)R-\ (61) 



The matrix K is antisymmetric. 

4. States of local-time subsystem 

The local-time subsystem is characterized by the se- 
quences of local-time probabilities {prit)} that obey a spe- 
cific evolution law. A state of the local-time subsystem 
corresponds to a particular sequence. Formally, each state 
involves the information contained in many local probabil- 
ity distributions {p^(i)}, one for each time. 

The presence of an evolution law greatly reduces the in- 
formation needed for the specification of a state, the rea- 
son is the restriction on the allowed {prity} imposed by the 
evolution law. For an evolution law formulated as a first 
order differential equation the allowed {prit)} must be so- 
lutions of this equation. As an example we may consider 
the local-time subsystem which is defined by the evolution 
equation (1371) . The necessary information for the specifi- 
cation of a state is the probability pi(tin) for some initial 
time tin, the sign s{tin), and the rule that the sign switches 
whenever dtPi = 0. This is somewhat cumbersome due to 
the explicit appearance of the sign function s{t). 

For a unitary time evolution law the specification of a 
state becomes much easier by use of the classical wave func- 
tion {gT-(t)}. Indeed, any particular solution of the linear 
differential equation (l40l) or ([57)) is uniquely determined by 
the wave function {qriUn)} at some arbitrary initial time 
tin- In short, the local-time subsystem is defined by the 
matrix K in eq. (pT|) . and a particular state of the local- 
time subsystem can be characterized by the classical wave 
function {qr{t)}. In turn {q^(t)} is uniquely determined 
by an initial value {qriUn)}- 

5. Equivalence classes for classical observables 

The system observables define equivalence classes for 
classical observables. Two different classical observables 
A and A' , with different values A^ and A'^ in the states of 
the all-time ensemble, are considered as equivalent if they 
are both system observables and if they have the same 
probabilities Wa for finding a value Aa for all states of the 
local-time subsystem. Two equivalent observables cannot 
be distinguished by any measurement of properties of the 
local-time subsystem. From the point of view of the subsys- 
tem they are identical. Two-level observables have only two 
possible values for Aa. They are equivalent if their expec- 
tation value is the same for all arbitrary sequences {prit)} 
that obey the evolution law which specifies the local-time 
subsystem. 

A simple example may be given for the local-time subsys- 
tem with evolution equation (j40p . corresponding to -R' = 
—iujT2. All allowed wave functions are periodic in time, 
with period 27r/a;. Furthermore, after a period f — tt/cj 
the wave function has only changed its overall sign such 
that the local-time probability distribution {prit)} is peri- 
odic with period f . In consequence, the occupation num- 
bers N{t) and N{t -f f) have the same expectation values 
for all states of the local-time subsystem. They belong to 
the same equivalence class. 

The explicit classification which classical observables be- 
long to the same equivalence class may be quite involved, in 
particular for more complicated evolution laws. The math- 
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ematical reason for the existence of equivalence classes is 
quite simple, however. We may restrict the discussion to 
two-level observables for simplicity. Two classical two-level 
observables A and A' are different if there exists at least one 
probability distribution p^: for which (A) ^ {A'). Let us 
denote by Vi the ensemble of {p^} for which (A) ^ {A'), 
and by 7^2 the ensemble for which (A) = {A'). The two 
observables A and A' are equivalent if all probability dis- 
tributions {puj} that obey the given evolution law belong 

to V2. 

More precisely, we may denote by Vi the ensemble of 
all-time probability distributions {puj} that obey the given 
evolution law. This is typically only a small subset of the 
ensemble of all possible probability distributions {pu;}- The 
complement of {puj} not obeying the evolution law is de- 
noted by Vi . There are many pairs of classical observables 
A and A' such that Vi C 7^2 ■ Such observables have differ- 
ent expectation values only for distributions Pi^ that belong 
to Pi . An observable A is a system observable if it has the 
same expectation value (A) for all p E Vi that lead to the 
same state of the local-time subsystem. In other words, 
(A) depends only on the equivalence class of classical prob- 
abilities that is specified by a given sequence of local-time 
probabilities {pr{t)}. Consider now a pair of two observ- 
ables for which C P2 • If one is a system observable the 
other is as well. The two observables are equivalent. In 
other words, two equivalent classical two-level observables 
are distinguished only by different expectation values for 
some Pu & Vi- 
lli turn, we may characterize a system observable by the 
probabilities Wa({Pr(0}) foi" the possible values Aq for all 
states of the local-time subsystem. This is sufficient from 
the point of view of the local-time subsystem. However, 
no unique classical observable is singled out by this pre- 
scription which fixes only the equivalence class. We will 
see that this situation has important consequences for an 
understanding of quantum mechanics. The system observ- 
ables will be associated with quantum operators. The lack 
of an unique classical observable associated to a quantum 
operator violates a basic assumption of the Kochen-Specker 
theorem [141] . This is why this theorem does not apply to a 
realization of quantum mechanics by a classical statistical 
ensemble (l3| . 



V. QUANTUM FORMALISM 

Our discussion has proceeded purely in the context of 
classical statistics. The all-time probability distribution 
{pu:} defines a standard classical statistical ensemble, and 
the observables are standard classical observables. As 
we have argued in ref. (isj it can be convenient to use 
the quantum formalism also for classical statistical ensem- 
bles. This is based on the "classical wave function" which 
amounts to the root of the probability distribution up yo 
a sign. It is precisely this wave function that appears nat- 
urally in the formulation of a unitary time evolution law. 
The quantum formalism arises therefore quite naturally for 
the description of the local-time subsystem with unitary 



evolution law. 

1. Probability amplitude and Schrodinger 
equation 

Our discussion of oscillating local probabilities has led 
us to the concept of a "probability amplitude" qrit). In 
the limit e we may cast the rotation of q in eq. ([57)) . 
(pTj) into the form of a differential "Schrodinger equation" 

dtq = -iHq, 

H = iK = {dtR)R~^ ^^a,L,. (62) 

z 

Here are the 2^~^(2^ — 1) hermitean generators of the 
orthogonal group SO{N') = 5*0(2-^), which are represented 
here as purely imaginary antisymmetric matrices, normal- 
ized to have eigenvalues or ±1, and az G R. At this 
level the wave function q(t) is real for all t. We can cast 
the Schrodinger equation into the more usual form with 
a A/'/2-component complex wave function iplt) whenever 
H is a linear combination of generators belonging to the 
U{N/2) subgroup of SO{N). (For time-independent Uz 
this can always be realized formally. However, a useful 
complex representation also requires that the observables 
which are relevant for the system are compatible with the 
complex structure.) On the other hand, a complex quan- 
tum mechanical wave function can always be written as a 
real wave function with twice the number of components. 

The rotations R can also be expressed in terms of ele- 
mentary bit operators, now acting on q 

R = Y,PvB^'\ (63) 

V 

The relation R^R — 1 restricts the allowed values for the 
real constants p^. An example is i? = R^ = B^^ + 

b'\^\ R^ = 1, cf. eq. (|49|) . This combined transformation 
is a bit operator which fiips the bit 1,(1,^2) O (0,71.2). 
Similarly, we can also express the Hamilton operator (|62[) 
or the SO{Af) generators Lz in terms of bit operators 
(with imaginary coefficients) . For the special case of a ma- 
trix Rtp which has in every line only one nonzero entry 

one has RrpRra = Wrp^pa = Vrpa , Wrp = {Rrpf > 

, J2t = 1 and pr{t) = J2p WrpPp{to). For this spe- 
cial situation the unitary evolution law (I57p results in a 
linear evolution law for the probabilities. 

2. Maps to quantum mechanics 

For probability distributions that obey a unitary evo- 
lution law one can construct maps to quantum systems. 
This allows to use the quantum formalism for establish- 
ing results for classical probability distributions, as we will 
see in the discussion of equivalence classes below. Typi- 
cally, many different maps to quantum systems exist and 
they are often rather formal and will be suited only for the 
demonstration of general properties. We will see in later 
sections, however, that also realistic quantum systems as 
quantum field theories for fermions can be obtained in this 
way. 

The simplest map uses the real wave function q with 
a purely imaginary Hamiltonian given by eq. (j62p . For 
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a purely imaginary Hamiltonian a real wave function is 
no conceptional restriction, since the time evolution would 
keep the real and imaginary components of a complex 
wave function separated. All classical observables based on 
the occupation numbers Na{t) at a given time are repre- 
sented by real diagonal quantum observables {Na{t))^^, = 
{Na{t)) ^6tt' ■ Their expectation values can be computed 
by the usual quantum rule 

{N^{t)) - {q\NUt)\q) 

= J2qrit){N^{t))^^,qr'{t). (64) 

A second map uses the analogy between Ising spins and 
occupation numbers for fermions. Since the "local states" 
are characterized by a sequence of occupation numbers 
na{t) = (0,1) we can associate a given state r = [nQ(t)] 
with a basis state for a multi-fermion system in the occu- 
pation number basis. Fermions with B — 1 "one particle 
states" can be described in quantum mechanics by a 2^~^- 
component complex vector in the occupation number basis. 
This is equivalent to a 2^-component real vector and we 
take the last bit for a distinction between the real part of a 
complex wave function for = 1, and the imaginary part 
for ns = 0. (A quantum time evolution with a real her- 
mitean Hamiltonian amounts then to oscillations between 
states with = 1 and ns = 0.) 

In the fermionic quantum system the observables are 
usually written in terms of annihilation and creation oper- 
ators Ofc, aj,, fc = 1 . . . i? — 1. With a = (fc, B) they can be 
directly transferred to our system of two-level observables. 
In particular, the fermionic occupation numbers Nk — aj^Ofe 
are represented by diagonal operators. The eigenvalues 0, 1 
correspond directly to the spectrum of the classical observ- 
ables Na{t) for a = k. The total number of fermions in 
the quantum system corresponds to Np — J2k^k- We 
may split k — (x, s) into a space index x and an internal 
index s, with x a vector on a d-dimensional lattice with lat- 
tice distance r\ and volume V . In the limit 77 — >■ 0, — ?> cxd, 
corresponding to _B — >■ 00, our system describes a quan- 
tum type of field theory for fermions in d-dimensions, with 
internal degrees of freedom labeled by s. 

The map to systems has been constructed only on the 
level of the local-time probabilities. The time evolution of 
a "many body wave function" 

V'([n,(f)];t) = (65) 
q{\ns{^)\,nB = l;tj + iq(^[ns{x)],nB = 0;i) 

is given by a Schrodinger equation. The translation of the 
Hamiltonian (|62|) to the complex basis yields a hermitean 
Hamiltonian which has, in general, both real and imaginary 
parts. 

3. Sequences of local probabilities 

For the discussion of this section we do not need the ex- 
plicit form of the "overall probability distribution" {puj}- 
It is sufficient that such a distribution exists, as we have 



established in sect. IIIIl Indeed, the evolution law together 
with the "initial condition" qrito) at some arbitrary to is 
sufficient in order to fix the whole sequence of local prob- 
abilities Priti) for all ti. (For the unitary evolution this 
holds automatically for ti > to and ti < to, since R is in- 
vcrtible. For an evolution according to a transition matrix 
the computation ofpT-(ti < t^) needs the invertibility of H^. 
Otherwise tg should be chosen as the smallest time.) The 
evolution law and the initial data therefore fix the equiv- 
alence class discussed in sect. IIIIl uniquely. For a unitary 
evolution we may therefore label the state of the local-time 
subsystem by the evolution law (the matrix K) and the ini- 
tial condition for qr{to)- (Here we assume some particular 
choice of signs for gr(io)- If two different qrito) and g^(io) 
lead to the same sequence of probabilities Prit) for all t 
they actually belong to the same equivalence class. This 
holds if qrito) and qrito) cannot be distinguished by the 
relevant observables. Otherwise we can refine the notion 
of equivalence classes by basing them on sequences qrit)-) 

We want to know which expectation values and correla- 
tion functions can be computed in terms of the sequences 
of local probabilities Prit), without invoking further infor- 
mation from the all-time probability distribution {pu}- For 
this type of question the mapping to the quantum formal- 
ism is very useful. In quantum mechanics we know that 
the Hamiltonian and the initial value of the wave function 
qrito) are sufficient for the computation of expectation val- 
ues of observables and their quantum correlations at dif- 
ferent times. All these properties depend therefore only on 
the equivalence class of a sequence of local probabilities, 
and not on the detailed all-time probability distribution 
{pu:}- Most of the information contained in {p^} is not 
necessary for the expectation values and quantum corre- 
lations. One may argue that for many circumstances the 
physical properties of a system can be characterized by the 
quantum correlations. In this case the knowledge of the 
equivalence classes given by qrit) is sufficient. In the other 
direction one may postulate that the outcome of measure- 
ments should only depend on the equivalence classes of 
sequences of probability distributions. Then the quantum 
correlations are good candidates for a description of the 
outcome of a sequence of measurements. 

We emphasize in this context that the quantum corre- 
lations are based on conditional probabilities as discussed 
in detail in [l^ . They are different from the "classical cor- 
relations" defined by eq. ([5]). Indeed, two different prob- 
ability distributions {puj} and {p'^j} which belong to the 
same equivalence class typically lead to different classical 
correlations. If only the information concerning the equiv- 
alence class of sequences of local probabilities is available, 
the classical correlations are often not computable. On the 
level of the local probabilities we deal then with "incom- 
plete statistics" [l2l.[l3l|. 



VI. GRASSMANN REPRESENTATION 

The analogy with fermions motivates the formulation 
of our Ising-type classical statistical ensemble with states 
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characterized by the values of two-level observables in 
terms of Grassmann variables. This leads to simple ex- 
pressions for the bit-operators. Besides the technical ad- 
vantages of the Grassmann formulation the next sections 
will show how the basic concepts of quantum field theory, 
namely fermions, quantum states and unitary time evolu- 
tion, arise in a natural way from our setting of probabilistic 
time. Once fermions are implemented, bosons can easily be 
constructed as suitable composite fields for fermion bilin- 
ears or more complicated bosonic composite fields. This 
may include gravity and gauge interactions as in spinor 
gravity [Tg} . 

1. Grassmann algebra 

In this section we construct the representation of states, 
probabilities, observables and evolution law in terms of a 
Grassmann algebra. We restrict our discussion here to the 
"local probability distribution" at a given time 

t. To each bit we associate a Grassmann variable "tpaTCt = 
I . . . B. Grassmann variables anticommute and obey the 
standard differentiation and integration rules 



0, (66) 

dip pip pip . . . ll>a„ =1pai--- V'a„ ■ 



To every state t = [ri/j] we associate a basis element of the 
Grassmann algebra, t ^ gr, which is a product of factors 
ipp according to the following rule: if within the sequence 
[np] one has for a given a the value tt-q = 0, we take a 
factor ipa in gr, while = 1 corresponds to a factor 1. 
The order of the factors ijja corresponds to the order in 
the bit sequence. Thus an ordered chain {ni,n2, . . . ^ns) 
corresponds to an ordered product of factors ipa for each 
Uq, = in the chain. An example for the map from the bit 
chain to the Grassmann algebra r — > g^- is 



(1,0,0,1,0,. 



(67) 



Since the order of the Grassmann variables in a product 
matters, we have to fix for this purpose some arbitrary 
ordering of the bits such that they are labeled in order 
by a = (l,2,...i?). The sign convention for the basis 
elements gr has a plus sign if all factors ipa are ordered such 
that smaller a are to the left. The Grassmann elements 
g-r form a complete basis in the sense that an arbitrary 
element g of the real Grassmann algebra can be written 
as g — ^T-gr 1 At G R. In other words, an element of 
the Grassmann algebra can be specified by the set of real 
numbers {At}. 

As a next step, we associate to an arbitrary probabil- 
ity distribution {pr} an element of the Grassmann algebra 
g (z G- Since this association holds only up to a sign it 
is convenient to use directly the isomorphism between the 
real classical wave functions {qr} and the elements g of a 
real Grassmann algebra. 
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9r 



Pt = Qr- 



(68) 



This defines a map G ^ V , .g — > {Pt} realized by g O 
{Qt},{Qt} ^ {Pt}- The map is not invertible since the 



components Qr are fixed by Pr only up to a sign. Only 
for a given sign convention qr = Sr^/p^, Sr = ±1, one can 
define an inverse map V G- 

We also define the conjugate element 5 of g by 
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^qrgr, 



(69) 



where gr obtains from r = [np] by taking a factor ipa for 
every Ua = 1, and a factor 1 for Ua — 0, with an appropri- 
ate sign such that grffr = V'iV'2 ■ • • V'b- More formally, we 
may define gr by the property 



(70) 



where the functional integral over all Grassmann variables 
reads 



ViP 



dipi 



dip2 / dipi 



(71) 



The elements g and g associated to the probability distri- 
bution {pr} arc normalized according to 

j T^'fp 99 = J X! IrgrQr'gr' "^^Pr = (72) 
r ' ,r T 

2. Grassmann operators 

We can realize a classical observable A as an operator A 
acting on g according to 

Agr = Argr , / ViPgAg - ^Pr^^ = (A). (73) 

In particular, the occupation number operator Afa associ- 
ated to Na reads 



d 

Afa = dalpa = TT— V'a- 
dlpa 



(74) 



Different occupation numbers commute, AfaAfp — AfpAfa- 
The equal-time correlation functions (|18l) can be computed 
as 



Cl^l.c^ - / V4,~9^a,...Afa„^9. (75) 



The bit operators find now a simple expression. For ex- 
ample, the annihilation operator for bit one, which is rep- 
resented by eq. on the level of the states r, reads 
now 



Si^) = Vi , Si'^ = 5i = 7^- (76) 
dipi 

We can use standard annihilation and creation operators 
for fermions 

aa=1pa , ai ^ da , Afa = aloa, 

{aa, ap} = {ai, a|j} = , {al, ap} = 6ap. (77) 



d 
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(We use here the standard notation a' for the creation op- 
erators, even though the notion of hermitean conjugation 
has not been defined for the Grassmann algebra at this 
stage.) The operators aQ,ajj correspond to the bit opera- 
tors B'f\B'f ^ up to a possible minus sign which reflects 
the position of Ua in the chain of bits for a given element 
9t- 

3. Grassmann valued wave function 

The element g = ^r^r plays the role of a Grassmann- 
valued wave function, similar to a vector which obtains 
as a sum over basis elements g-r with coefficients q.,.. The 
probabilities Pr obtain from g by using a projector Vt with 
the property 

'Prg = Qrgr , Pt = J VtpgVrg, (78) 

T 

l^r'^p — ^T^rp : "^rgp — gr^rp- 

Here Ig is the unit operator in the Grassmann algebra, 
Igg = g. If we describe g by the set of coefhcients {qr} one 
hiis Vp{qr} = {0,0,...0,gp,0...0}. 

For an arbitrary Grassmann operator J- one has 

f = ^9 = Y1 ^-"Sr , ^9p = X! ^^p9r- (79) 

r T 

For linear operators this yields 

^9 = ^^ QpQp X! 'ip^dp = X! ^rpqp9T , (80) 

P P T.P 

and we infer 

f, = J2Frpqp. (81) 

p 

If we consider the coefficients {qp} as real vectors, the linear 
Grassmann operators are represented as real matrices F^p. 

In this picture the projector Va- is represented by the 
diagonal matrix {Vc)tp, 

V, ^ {Va)rp - ^ar^ap- (82) 

For an arbitrary Grassmann operator one has 

VrTVr ^ FrrVr- (83) 

The explicit form of Vr can be written as 

Vr-=^ 9rVrVr~gr, (84) 

where gndr stand for the multiplication with the corre- 
sponding Grassmann elements. The operators P^.,!?^ in- 
volve products of derivatives such that applied on g^ and 
gr they yield 

Vrgr 1 , = 1- (85) 

Thus the projector Vt provides an explicit formal construc- 
tion for the extraction of the probability distribution {pr} 
from the Grassmann wave function g. 



At this stage we have expressed all statistical proper- 
ties of our system of two-level observables in terms of a 
Grassmann algebra. We can associate to every normalized 
element of a Grassmann algebra (^^ = 1) a set of clas- 
sical probabilities Pr — 9^, and to every "real diagonal" 
Grassmann operator A a classical observable Ar with ex- 
pectation value given by eq. (|73p . The time evolution of 
the probability distribution typically involves Grassmann 
operators J- ((79| that are not diagonal. 

4. Complex Grassmann algebra 

The Grassmann algebra can be extended to a complex 
Grassmann algebra, where the coefficients qr are replaced 
by complex numbers Ct, such that the probability distri- 
bution {pt} is represented by 

r r 

As discussed before, we may use the last bit for the dis- 
tinction between the real and imaginary parts of a complex 
wave function, with a = {k,B) and r' labeling the 2^^^ 
states corresponding to the bits labeled by k, 

qT'.+ = q(^[nk], HB = ij , qr'.- ^ q(^[nk],nB ^ Oj, 

Ct' = qT',+ + iqr'-- (87) 

We may now construct a complex Grassmann algebra us- 
ing Grassmann variables V'fe (no "05 !) and basis elements 
gr', with Pr' = \cr'\^ and g = Cr'gr'^g c*>9r'- In princi- 
ple, a Grassmann algebra based on i? — 1 Grassmann vari- 
ables with complex coefficients Cr' contains the information 
about B bits. However, the probabilities pr' correspond to 
a type of "coarse graining" since they add the probabilities 
of states with hb = 1 or ub = 0,pr' — q'r^ ^ + qr' _■ 

Correspondingly, only the occupation numbers A4 are 
represented by eq. (|74)) , whereas A/b projects on the 
real part of the Grassmann algebra, AfBiJ2r '^■^9'^) ~ 
J2r Rc{cr)gr- (Notc that at this stage the notion of "real" 
or "complex" is not defined for the Grassmann variables ipk 
or basis elements gr'-) The multiplication of a Grassmann 
element by the imaginary unit i corresponds to a bit opera- 
tor acting on ub, namely qijiB = 1) ^ q{nB = 0),q{nB = 
0) — —q{nB = 1), or a switch between tib = 1 and ub = 0, 
with a subsequent change of sign for the states q{nB = 1)- 
We emphasize that the association (1571) defines a particu- 
lar complex structure. Many other complex structures are 
possible for the wave function and the Grassmann algebra. 

VII. FERMIONS 

In this short section we recall how each element of the 
Grassmann algebra can be associated with the quantum 
wave function for a multi-fermion system. In turn, this 
implies that the local probabilities {prit)} can be used in 
order to describe fermionic quantum systems. Of course, 
this is only one of the possible interpretations. The for- 
mulation in terms of Grassmann elements is particularly 
suitable for a system of fcrmions. 
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A pure state quantum wave function for a multi-fermion 
system with F different fermion species can be written in 
the occupation number basis as 

ip = J2cr0r , ^|C,|' = 1, (88) 

r r 

with basis states 

^1 = |l,l,l...l) , ^2 = |0,l,l,...l), 

^3 = |l,0,l...l),... (89) 

Here the index r labels the 2^ basis states with fixed oc- 
cupation numbers and Ct are complex coefficients. We can 
associate the basis states (pr with the basis elements gr of 
the Grassmann algebra 

51 = 1 , .92 = -01 , 53 = -02, • ■ • (90) 

For this purpose we work with a complex Grassmann al- 
gebra, with F = B — 1, and the last bit labeling real and 
imaginary parts of the wave function. The wave function ip 
([55)1 is then associated with an element of the Grassmann 
algebra or Grassmann wave function 

V = 9 = ^Crgr- (91) 

T 

The annihilation and creation operators Oq, , aj^ for 
fermion species a are represented as Grassmann operators 
according to eq. ([77|) . The basis states can be obtained 
by applying products of Oq, on the totally occupied state 
(fii. (We choose here sign conventions for the basis states 
such that the sign is positive if the are ordered such 
that smaller a are to the left. This corresponds to a con- 
vention for the Qr with positive sign if the factors ipa are 
ordered with the smaller a on the left. We may call this the 
"hole convention" since the representation of states with a 
certain number of zeros in the bit chain is particularly sim- 
ple. In sect. IXIVI we will encounter a different "fermion 
convention".) In this way all operators acting on Grass- 
mann elements are directly related to operators acting on 
the multi-fermion wave function. 

In particular, we can write the unitary evolution law in 
the form of a Grassmann evolution operator L{{t,to) acting 
on an initial state 5(^0) 

g{t)^U{t,tn)g{to). (92) 

Properties of this evolution operator are discussed in the 
appendix. In the Grassmann formulation the Hamilton 
operator "H is defined as 

H{t) = idfU{t,to)Uit,to), (93) 

with U the hermitean conjugate of U (see appendix). The 
time evolution (j92p can then be written as a Schrodinger 
equation 

idtgit) = idtU{t,ta)g{to) 

- nit)Uit,to)gita)^nit)git). (94) 



VIII. TWO-STATE QUANTUM MECHANICS 
FROM UNITARY EVOLUTION OF CLASSICAL 
PROBABILITIES 

So far our discussion has remained rather abstract and 
general. In the following sections we will discuss a few 
simple examples. We start in this section by presenting an 
explicit unitary evolution law for classical probabilities that 
realizes the Schrodinger equation for two-state quantum 
mechanics. 

1. Unitary evolution 

We take a system with only one value of a such that 
the probability distribution {pr(i)} reduces at a given t 
to the two probabilities {pnit) , Pi{t)} for the states with 
N{t) = 0, 1, with po{t) +pi{t) = 1. The state of the local- 
time subsystem is specified by the sequence {pi{t)}. We 
employ here a language with continuous time but discrete 
time steps ti can be easily implemented. We start with the 
equivalence class of all-time probabilities {pi^} that lead to 
the sequence of local-time probabilities pi{t) = cos^ ujt in 
eq. ([55]) , for which we have constructed some explicit repre- 
sentative all-time probability distributions in sect. IIIII The 
local-time probability distribution obeys the evolution law 
([37|). which belongs to the unitary evolution laws of the 
type ([55]) , (|57|) . Other sequences of local-time probabilities 
that obey the same evolution law are given by 

pi{t) = cos^{ujt)pi{0) + sm^{ujt)po{0) 

-2s cos(cjt) sm{ujt) ^/pi {0)po (0) , (95) 

where we set to = and s = ±1. For po{0) — this cor- 
responds to the sequence of local-time probabilities 
while the other states of the local-time subsystem with 
Pq (0) ^ can be constructed from suitable all-time proba- 
bility distributions {pu;} in an analogous way. We observe 
that a change in the sign s is equivalent to a time reflection 
t — > —t. The mean occupation number obeys 

{N{t))^p,{t). (96) 

For an explicit specification of the evolution law the co- 
efficients Vrpcr{t,0) in eq. ([53]) read 

^111 = Vaoo = cos2(wi) , VioQ = Foil = sm^cut), (97) 
VuQ = Vioi = -Voio = -Vqoi = -scos{ujt) sin{ujt). 

In turn, the rotation matrix R in eq. (|57p can be written 
as a real unitary matrix U 

( cosujt, -sinwA / ■ +\ /noN 

^ = [ sinct, cosct j = ^ = exp{-^u^r,t). (98) 

This corresponds to two-state quantum mechanics with a 
Hamiltonian H — ujt2 and a real two-component initial 
wave function ^p{Q) = (gi(0), 90(0)) = (^pi(O), s^Jpo{Q)). 
Using the standard quantum formalism for a real two- 
component wave function one has 

^(*) ^ f ' ^*^(^) = -^Hipit) , H = uT2. (99) 
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The wave function Lp{t) = U{t,0)Lp{0) remains real in the 
course of the evohition. Evaluating the expectation value 
of the operator N = ^{1 + t^), which is associated to the 
occupation number observable, 



(Nit)) = {ipit)\N\cp{t)) = {ip{0)uHt,0)\N\U{t,0)cp{0)), 



(100) 



one recovers eq. (|96|) . 



2. Evolution of Grassmann elements 

The Grassmann algebra (at a given t) contains only two 
basis elements, {gr} = {9i,go} = {IjV'} s-^d the state is 
represented by 



git)^qi{t) + qoit)i:, 



(101) 



with 



qi{t) = cos{iot)y/pi{0) ~ sm{ujt)sy/po{0), 

qo{t) = sin(wt) Vpi(0) + cos(wt)sVpo(0), (102) 



and 



pi{t)=qi{tf , Pa{t)^qo{tf 



(103) 



We observe that a given initial sign convention for qi(0) 
and Qo(0) fixes the signs for qi{t) and qo{t) for all t. The 
sign convention at t = is arbitrary and corresponds to a 
choice of gauge. 

The Grassmann evolution operator 



U{t) = cos(ajt) + sin(a;i) ( -0 



d_ 

dip 



(104) 



= cos(ajt) + sin(a;t)(a — a^) 
describes the time evolution 

g{t)^U{t)g{0). (105) 
The conjugate basis elements read {gr} = {^i 1}: with 

g{t) = qo{t)+qi{t)i/j. (106) 
Correspondingly, the conjugate evolution operators read 

li'^{t) = cos{ujt) ~ sin(wi) (ip + S 



(107) 



U(t) — cos(u!t) — sin(cji) V ^ tt' > 
U{t)U{t) = 1. 



Here we use the sign operator S which reverses the sign of 
all odd elements of the Grassmann algebra, i.e. 

Sgr = (-l)*'\9r, (108) 
with Mr the number of factors of ip in g^-. 



Using ^V' ~ = — 1 we can write 



u{t) = exp <; w ( ^ - — u 



U{t) = exp\-Lj{iJj- —]t 



(109) 



This can be cast into a Hamiltonian form similar to quan- 
tum mechanics, 

idtU = nU , U = cxp{-im), 

n ^ iuj[%P- ^iuj{a-a^), (110) 



with 



0,0 
1,0 



0,1 
0,0 



(111) 



This realizes the Schrodinger equation idtg = Hg. 

3. Time evolution of occupation number 

In the Grassmann formulation the occupation number is 
represented by the Grassmann operator 



dtp 



(112) 



It translates to the quantum operator TV — (l-|-r3)/2, with 



d#(t)— V'ff(i) 



(113) 



Motivated by the quantum analogue we evaluate further 
Grassmann operators as ^ = V' + ^ , 

{A{t)) = J d^g{t) + ^) g{t) = '2qo{t)qi{t) 

= 2sin(a;t)cos(wt)(2pi^o - 1) (114) 
+2s(cos^(a;t) - sin^(a;t)) ^pi.o(l - pi,o)- 

A priori, the classical statistical interpretation of {A{t)) 
may not seem obvious. A quick calculation reveals, how- 
ever, that {A) coincides with the time derivative of eq. (|95p 
up to a factor — w. 



dt{N) =-Lj{A). 



(115) 



This is not surprising in the quantum picture. The quan- 
tum operator associated to ^ is ti, and we may compute 
dt{N) in the Heisenberg picture, [H,N] = iujTi, or 

dt{N)=t{^{t)[H,NMt))=^u:{^{t)\T,\^{t)). (116) 

This simple observation has interesting implications. We 
have started with a simple classical statistical ensemble, 
say with a probability distribution (f^ . ([5^ or the gener- 
alization to a finer grid discussed in sect. |lll The local (in 
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time) probabilities po{t),pi(t) show the periodic behavior 
([55]) . This classical statistical ensemble is characterized by 
the sequences of local-time probabilities (|95|) which obey 
a simple unitary evolution law. Despite this completely 
classical statistical structure, we have found a quantum 
description with non-commuting observables 

N ^^{l + Ts) , dtN ^-L^n. (117) 

We will discuss this issue in the next section. 



IX. UNEQUAL TIME CORRELATION 
FUNCTIONS AND NON-COMMUTING 
QUANTUM OPERATORS 

We next address the question which properties follow 
purely from the evolution law of the classical probabilities, 
and which ones need additional input from the all-time 
probability distribution. This will concern unequal time 
correlations of the type (O and the status of the observ- 
ables as dtN{t). 

1. Time-derivative observables 

The status of the observables N and dtN is different. 
Going back to the formulation with discrete time steps it 
is obvious that {N{ti)) can be directly computed from the 
probability pi{ti), where the past and future play no role 
and can be integrated out according to eq. (IHl) . In con- 
trast, the expectation value of the observable 



{dtN),{U) = 



N{U) ~ N{t, - e) 



(118) 



requires, in principle, more information than contained in 
Pi{ti). It involves the joint probabilities of two "time lay- 
ers" ti and U — e, corresponding to extended local proba- 
bilities for an interval with two time points in the sense of 
sect, mil One needs the four probabilities {pii{ti,ti — e), 
Pw{ti,k - e), Pm{ti,ti - e), pao{ti,U - e)} for finding N{ti) 
and N{t.i — e) in the combinations (1, 1), (1, 0), (0, 1) and 
(0,0), respectively. (The sum of the four joint probabili- 
ties equals one.) They can be obtained from eq. ^U\\ . now 
integrating out only the future {t' > ti) and the past before 
U -e{t' <U~ e). 

In terms of the joint probabilities one finds {dtN)f{ti) 
and the classical correlation between N{ti) and {dtN)^{ti) 
as 

{{dtNUU)) - i(pio-poi), (119) 
{NiU){dtN),iU)) = ipio. (120) 

Two out of three independent joint probabilities can be 
computed from pi(ii) and pi{ti — e), 



(Nik)) 
{N{U - e)) 



Pii 
Pii 



-Pio = PiiU 
Poi = Pi {ti 



(121) 



This allows us to compute piQ — poi f^nd therefore 
{{dtN)g{ti)) from pi{ti) and pi{ti — e), in contrast to pio 



and the correlation {N{dtN)f} . The expectation value of 
{dtN)f^ can therefore be computed from the information 
contained in the state of the local-time subsystem, while 
{N{dtN)^) needs information beyond the local-time sub- 
system. 

The evolution law relates pi(ii — e) to pi(ti) 



p, (7 = 0,1 

= X! Rip{ti,ti - e)Ria{ti, U - e)qp{ti - e)q^{U - e) 



cos^{uje)pi{ti — e) + sin^(cje)po(ii — e) 
— 2cos(a;e) sin(a;e)gi(ii — e)qo{ti — e). 



(122) 



and we infer 



{{dtN)^{U)) = — cos(a;e) sin(we) 



\Jpi{ti - e)(l -pi(tj - e))sign((gigfo)(ii - e)) 
+ -sm^{ue){l'-2pi{t,~e)). 



(123) 



In the limit e — > only the first term contributes, yielding 



{dtN{t)) 
s{t) 



~2Lo^p,{t){l-piit))s{t), 
sign(gi(t)go(t)). (124) 



By virtue of the evolution law (dtN) is almost computable 
in terms of the probability pi (t) - the only additional infor- 
mation which keeps some additional memory of the states 
of the system at earlier times is its sign s{t). The wave 
function ((71,90) precisely contains the sign information, 
beyond the probabilities related to |qi|,|<Zo|- (An overall 
common sign of qq and qi is unobservable since it cancels 
out in the computation of observables.) 

We may interpret the pair (pi{t), s(t)) as the information 
characterizing the state of a statistical "fixed-time subsys- 
tem" related to a given time t. Information contained in 
the full ensemble for all U - i.e. in the sequence of all local- 
time probabilities {pi{ti)] - which goes beyond {pi{t), s{t)) 
is not relevant for expectation values of fixed-time system 
observables. The information (^pi{t) , s{t)) which character- 
izes the state of the fixed-time subsystem is conveniently 
collected in the wave function qr{t),T — 0, 1. Apart from 
the irrelevant overall sign the information about the wave 
function is indeed contained in pi{t) and s{t). Similar to 
quantum mechanics it is nevertheless convenient to specify 
the fixed-time subsystem with the two-component normal- 
ized wave function qrit). This contains the overall sign 
as redundant information, similarly to the redundant over- 
all phase in quantum mechanics. For any initial condition 
Irita) the fixed-time subsystem is completely determined 
by the evolution law. In terms of the wave function eq. 
(|124p finds the simple expression 



{dtN{t)) = -2ujqi{t)qo{t). 



(125) 



2. Ambiguities for time-derivative observables 
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Even though (dtN) can be computed from the wave 
function {qrit)} it is not a system observable in the stan- 
dard sense. The expectation value of the squared classical 
observable (9t-/V(i)) cannot be computed unambiguously 
from s(t)). This becomes apparent if we go back to 

discrete time steps and consider (dtN)^. The spectrum of 
the possible values of {dtN)^ in a classical state u — {n{ti)} 
contains three values {—-,0, i). Thus {dtN)1 has eigenval- 
ues 1/e^ and and the expectation value 



{{dtN)l 



-Poi) 



(126) 



diverges for e ^ (unless pio + poi '■^ e^, which is only 
possible for {dtN) = 0). 

Furthermore, there are several different discrete observ- 
ables which yield the same continuum limit for dtN. For 
example, dtN can be associated with {dtN)~ as given by 
eq. (|118p . but also with 



(dtN)t = 



+ _ N{U + e)-N{U) 



(127) 



or 



(dtN), 



N{t, + e) - NjU - e) 
2e 

l{(dtN)t + {dtN):). (128) 



The expectation value of (dtN)'^ obeys the same formula 
as for {dtN)~ , except that the wave function is now taken 
at < -|- e instead of t. This difference vanishes in the contin- 
uum limit such that the expectation values of the discrete 
observables (jllSp . (|127p and (|128p have a common contin- 
uum limit. However, we can now also consider products 



{dtN)t{dtN)- = -{NiU)N{U + e) + N{U)N{U - e) 
~NiU + e)N{U - e) - N{U)}. (129) 

The expectation value of this observables involves joint 
probabilities at three time layers ti + £,ti and ti — e. Using 
the relations 

{N{U)) = Pill +P110 + Poll +P010, 
{N{ti)N{ti + e)) = piii+puo, 
{N{ti)NiU-e)) = Pill + mi, 
{N{U + e)N{U-e)) -pm+pioi, (130) 

one obtains a negative expectation value 

{{dtN)t{dtN):) = -4(Pioi +Poio), (131) 

in contrast to the positive expectation values 
2 1 

([(5fA^)+] ) = ^(pioi +P100 +P011 +Poio) 
l\{dtN)-^) = ^(piio+Poio+Pioi +Pooi). (132) 



The discrete observables (|131l) and (|132[) take different val- 
ues for e — >■ and there is no unambiguous well defined 
continuum limit for {{dtN)"^). 

While the continuum limit of the expectation value of 
the observable dtN is well defined, the spectrum of the dis- 
crete observables {dtN)f and {dtN)2e is different - for the 
last observable the possible values for the classical states 
are {dtN)2e = ±l/2e, 0. This lack of uniqueness of the 
spectrum is in close correspondence with the lack of a well 
defined continuum limit for {dtN)^. If one would like to 
perform measurements of dtN or (dtN)^ the lack of unique- 
ness of the discrete observables would be very annoying. 
The outcome of a series of measurements in identical set- 
tings would depend crucially on the precise details of the 
measurement apparatus which would need a time resolu- 
tion of the order e and a precise statement which particular 
discrete observable is measured by a particular apparatus. 
These are typically not properties of a realistic appara- 
tus, and one would like to have a more robust definition 
of observables that can be measured. That this should be 
possible is suggested by the existence of a unique limit for 
(dtN). 

3. Non-commuting quantum operators 

The lack of robustness of observables containing time 
derivatives has already been noted in ref. [l^ . There it 
was proposed that more robust observables can be based 
on equivalence classes that correspond to the operators in 
quantum mechanics. Indeed, we have found in eq. (I117P 
the relation 



{dtN) = -w(ti) = -2ujqi{t)qa{t). 



(133) 



This equals precisely the expression (jl24l) . (We can iden- 
tify in leading order t—e and t.) The information contained 
in the wave function q{t) is sufficient to compute the ex- 
pectation values of off-diagonal operators as ti in addition 
to the diagonal ones. 

Observables as dtN can be represented as commuting 
"diagonal observables" for the all-time statistical ensemble 
for all states {n(t)}. While the spectrum depends on the 
details of the particular definition, the expectation value 
is robust in the sense that the detailed implementation 
does not matter. The robust expectation value can also 
be computed from off-diagonal quantum operators for the 
subsystem at a given time. In accordance with the gen- 
eral considerations how a quantum system can arise from a 
classical statistical ensemble the joint probabilities for two 
observables expressed by non-commuting operators are not 
available within the subsystem [l^. In our case this con- 
cerns the classical correlation {N ■ dtN) which cannot be 
computed from the information in the local-time subsys- 
tem. 

The robustness of the expectation value raises the in- 
teresting question if one can find a classical observable for 
dfN that becomes a system observable in the continuum 
limit. Such an observable must be a two- level observable 
with possible measurement values uj and — w. We leave this 
for future investigation. 
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4. Unequal time occupation numbers 

While the issue of a useful classical observable for dtN 
is somewhat involved, there are other much simpler cir- 
cumstances where off-diagonal quantum operators appear 
for the local subsystem at a given time t. The classical 
observable N{t + a) is well defined, with spectrum (1,0). 
For a 7^ it may be called somewhat loosely an "unequal 
time occupation number" , where unequal relates to dif- 
ferent time arguments for the occupation number and the 
local-time probabilities. The expectation value {N{t + a)) 
is computable from the information available for the local 
subsystem at time t, characterized by qr(i) or ^p{t). One 
has the identity 

{N{t + a)) = ql{t + a) (134) 

with, cf. eq. dMl), 

^ / cosa;a , - sinu^a ^ + (135) 

^ ' \ smwa , coswa J ^ ' ^ ' 

We can therefore express the expectation value of A^(t-t-Q;) 
by the quantum operator N{t -f a), 

(iV(i + a)) = (^^(t)7V(t + a)V3(i), 

N[t + a) = ^R^{a){l+T3)R{a) (136) 

= ^U\t + a,t){l + T3)U{t + a,t). 

This is precisely the Heisenberg picture of quantum me- 
chanics, cf. eq. (|100l) . Since N{t -I- a) is a classical two- 
level observable and its expectation value is computable 
from the state of the local-time subsystem it is a system 
observable. 

The interpretation of measurements in quantum mechan- 
ics can now be inferred from the basic postulates of classi- 
cal statistical mechanics. In the all-time classical statistical 
ensemble N{t + a) is a standard classical observable, and 
the only possible outcomes of measurements are zero and 
one. This corresponds to the spectrum of eigenvalues of the 
quantum operator N{t + a), explaining the quantum rule 
that only eigenvalues of the quantum operator A associated 
to an observable will be found in measurements. The quan- 
tum rule for the expectation value (jl36|) has been obtained 
from the classical rule for expectation values. The proba- 
bilistic interpretation of quantum mechanics states that an 
eigenvalue X„i will be found with a probability p,„. This 
probability obtains by choosing a basis |to) of eigenvalues 
of A, A\m,) = Amlm), as 

p,„ = |((^|m)|2. (137) 

(We assume here non-degenerate eigenvalues for simplic- 
ity.) For the operator N{t -\- a) these are exactly the prob- 
abilities in the classical statistical ensemble to find the val- 
ues Ai = 1, A2 = of the observable N{t -f- a), namely 

Wi ~ pi{t + a) = qj{t + a) 

= {Mt)R,Aa))'^\{^m)\\ (138) 



with 




(139) 



and W2 = I — wi. We conclude that non-commuting op- 
erators and their standard association to observables and 
measurements in quantum mechanics arise in a natural 
way from our classical statistical ensemble if one focuses 
on local-time subsystems. 

The explicit operator representing the occupation num- 
ber N{t + a) in the local-time system with state character- 
ized by (p{t) reads 

, / cos^wa , — sinwacoswa \ 
N{t + a)=( . ' .2 

y — sm cos OJQ! , sm uja J 

11/2 ■ 2 ^ 

= - + -(cos wa — sm wajrs — smwacos wari. 

(140) 

In particular, one finds 

iV 4(1 -n). (141) 

This yields for the expectation value of dfN the relation 

(Wt))=2^(^(7v(i+£))-0. (142) 

This relation holds independently of the particular defini- 
tion of the observable dtN, while the observables dtN{t) 
and N (t + ^) are not necessarily identical. 

X. MEASUREMENTS AND QUANTUM 
CORRELATION 

The classical correlation function {N{t + a) ■ N{t)) can- 
not be computed from the information of the local-time 
subsystem. This conforms with the general discussion of 
ref. [1^ and reflects the fact that the local-time subsys- 
tem is described by incomplete statistics. We can therefore 
not use the classical correlation function or joint probabil- 
ities for the description of a sequence of idealized measure- 
ments in the subsystem. Such sequences of measurements 
have to be described by a different "measurement corre- 
lation" which is based on conditional probabilities rather 
than joint probabilities 13]. The measurement correlation 
{N{t + a)N{t))m is given by the conditional probability 
to find N{t + a) ~ 1 if N{t) has been found to be equal 
to one. For an idealized measurement in a subsystem the 
conditional probability must be computable in terms of the 
available information ip{t). It should not involve additional 
information about the "environment" , as the classical cor- 
relation does. 

For a two-state system a consistent conditional proba- 
bility obeying this criterion leads to 

{Nit + a)Nit))^ = {Nit + a))^^{qiit)f. (143) 
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This can be expressed by the rule that after the first mea- 
surement N{t) — 1 the wave function ip{t) is replaced by 
the eigenstate of N{t) with eigenvalue one, and that sub- 
sequently N{t + a) is measured in this new state. This is 
the "reduction of the wave function" , which appears here 
merely as a convenient rule for specifying the conditional 
probability relevant for a sequence of idealized measure- 
ments in a subsystem. 

The rule of "reduction of the wave function" for the com- 
putation of conditional probabilities can be generalized to 
other system observables. It is consistent with the require- 
ment that two measurements of an observable A at the 
same time t (or at infinitesimally close times) should yield 
the same measurement value. We will assume in the fol- 
lowing observables with a non-degenerate spectrum. If the 
first measurement yields Am, the "reduced state" is |m). 
Then the expectation value of A in the reduced state is 
again Am, and 

{A{t)A{t))„, = \lmt)\m)\^ 

i^m^itMt)}. (144) 

More generally, a consistent conditional probability to 
find the value cr„g of an observable B if Am^ has been 
found for A reads 

w(f7„B |Am^) = |(nB|mA)P, (145) 

where A\mA) = Am^lmA) , B\nB) = (JuBlns)- With 
WniA — K'^'I^a)!^ the probability to find A^a in the first 
place, the measurement correlation becomes 

{BA)„i = ^ Cr„gA„i^w(o-„^ |Am_4)Wm_4 
mA,nB 

= ^ Am^Wm^(TOA|i3|mA). (146) 

The last expression multiplies the value found in the first 
measurement of A with the probability to find it, and then 
with the expectation value of B in the state toa, and finally 
sums over all possible measurement values of A. We can 
write the measurement correlation in the form 

{BA)m^{'p\Ci^\B,A)\ip), (147) 

" (2) 

with Cm a hermitean operator (symmetric 2x2 matrix) 
constructed from the operators A and B. In general, the 
sequence of two measurements matters 

Ci^\A,B)^Ci^\B,A). (148) 

For the special case where A and B are traceless two-level 
observables, with two non-degenerate eigenvalues (a, —a) 
or (6, —b), one can express the measurement correlation 
by the anticommutator of the operators 

C^^HbJ) = Cif{A,B) = (149) 

The anticommutator {A, B} involves the operator product. 
In this case the quantum operator product enters directly 



in the determination of measurement correlations. This op- 
erator product reflects a product structure for observables 
(isj that is consistent with the structure of equivalence 
classes of observables. Here we recall that two two-level ob- 
servables Ai and A2 belong to the same equivalence class if 
they are both system observables such that their expecta- 
tion values can be computed from the wave function of the 
local time system, and if (Ai) — {A2) for all possible wave 
functions ip{t). In this case Ai and A2 are represented by 
the same quantum operator A. (An example for our simple 
two-state system are dtN{t)/uj and 2N (t + ^) — 1, which 
are both represented by the operator — ri.) 

For more general observables with a non-degenerate 

" (2) 

spectrum eq. (|149p does not hold and Cm becomes a 
non-linear expression of A and B. (One can show that 
no alternative consistent definition of conditional probabil- 
ities is possible which would lead to {N{t + a)N{t))m = 

{(p{N{t + a), N{t)^ip) /2.) In a basis of eigenstates of A the 

" (2) 

operator Cm {B,A) is diagonal 

C^^\I3,A) = Y,\'^A)cBA{mA){mA\, 

rriA 

CBAirriA) = ^\{nB\mA)\'^<JnBKiA- (150) 

llB 

The diagonal elements cba can be computed from A and B 
independently of the particular representation of these op- 
erators. On the other hand, the inverted sequence typically 
also involves off-diagonal elements in this basis, 

C'^niAB) = ^ \mA)cAB{mA,mc){mc\, 

mA,mc 

CAB{mA,rnc) = (151) 

^ {uiAlnB) (riB |mc) {ub |™A> l^f^ns ^m'^ ■ 
rtB ,m'^ 

One may use projection operators on eigenvectors of A 
or B 

PmA = \mA){mA\ , PnB = 

A = ^m^Pm^ , P = ^a„^P„,, (152) 

ruA nB 

and write 

(P^)m — ^ ^ ^mA^nB^rnAP^BP^iA^ 
mA,nB 

{AB)m = E ^ 

(153) 

mA,nB 

Since all quantities a„B,XmA^ and WmA that ap- 

pear in eq. (jl46p can be extracted from the quantum 
operators A and B we can construct {BA)m in terms of 
these operators. This measurement correlation therefore 
only involves the equivalence classes of observables as it 
should be. For the special case of two-level observables 
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XmA = ±A,cr„^ = ±cr one recovers eq. (|149l) . while eq. 
(|143p can be expressed as 

(Ar(t + a)A^(0)m = (^(i)^(i + (154) 

The expressions (|153p do not depend on the choice of basis 
for the representation of the operators. 

The use of measurement correlations that differ from 
the classical correlations is mandatory for idealized mea- 
surements of a system that is characterized by incomplete 
statistics, as the local-time subsystem in our case. This 
is also the way how "no-go theorems" based on Bell's in- 
equalities [17] for a representation of quantum mechanics 
by a classical statistical ensemble are circumvented. In- 
deed, a measurement correlation based on joint probabili- 
ties instead of conditional probabilities would imply Bell's 
inequalities [T8| . The experimental verification of a viola- 
tion of Bell's inequalities can therefore be used as evidence 
that only the information for the local-time subsystem is 
available or used for ideal measurements, rather than the 
all-time probability distribution. We emphasize that cor- 
relation functions that differ from the classical correlation 
function do not only appear in quantum mechanics. They 
are genuinely needed for a description of measurements in 
classical statistical systems as well [31 . It is remarkable 
that already our simplest example of a periodic all-time 
probability distribution (1331) shows almost all characteris- 
tic features of quantum mechanics. 

XI. QUANTUM WAVE FUNCTION FROM 
CLASSICAL STATISTICS 

We have seen several features of quantum mechanics 
emerging from our attempt to formulate classical statis- 
tical ensembles that admit an evolution law that can ac- 
commodate for a periodic time evolution. It may be useful 
to summarize what we have achieved so far: 

1. We associate to the unitary evolution law for a clas- 
sical probability distribution a quantum mechanical 
Schrodinger equation. 

2. For Ising-type classical statistical ensembles with a 
unitary time evolution we have formulated a repre- 
sentation in terms of Grassmann variables. This al- 
lows a simple representation of fermions and bit op- 
erations. 

3. We have found that system observables for the local- 
time subsystem can be represented by quantum op- 
erators. These operators show a non-commutative 
product structure. We have advocated that the cor- 
relation of idealized measurements within the local- 
time subsystem is based on the non-commutative op- 
erator product. 

At this state it may be useful to ask about the status of 
the quantum mechanical wave function within our classical 
statistical setting. 



1. Maps between v^^ave function and probabilities 

The association between a Schrodinger equation and the 
time evolution of a classical statistical ensemble can be dis- 
cussed independently of the Grassmann formulation (cf. 
sect. lIVp . This also holds for the wave function. The 
role of the quantum wave function is best characterized 
by mapping the wave function ipit) to a local-time prob- 
ability distribution {prit)} on one side, and by a map 
from the all-time probability distributions {pu;} to quan- 
tum wave functions on the other side. We start with the 
map tp{t) {Pr{t)}- 

Consider first an arbitrary real quantum mechanical 
wave function ij){T]t) with r a set of continuous or dis- 
crete variables as position and spin. For any given time t 
this defines a classical statistical ensemble with probability 
distribution 

Pr{t)=p(T-t)= (155) 

A Schrodinger equation for the time evolution of ij: induces 
a time evolution law for the classical probabilities Prit)- It 
is of the unitary type (US]), ([58)) . 

The restriction to real wave functions entails no loss 
of generality. Indeed, we can always choose a real rep- 
resentation of the quantum wave function. A complex 
function iplr'-^t) can be decomposed into real and imag- 
inary parts ■^'{t' ,t) = il)fi{T',t) + iipiir' ,t). Interpreting 
77 — {R, I) as a discrete index with two values, we label 
T — (r', 77) and consider the real function ^P{t] t) defined by 
i{j{T',R;t) = ^fi{t' ,t),i}j{T' ,I;t) = i}ji{T',t). The unitary 
evolution of iP{t' ] t) described by the Schrodinger equation 
turns then to a rotation for the evolution of the real wave 
function ^{T]t) = qrit) according to eq. ([57)) . 

The opposite direction, namely the association of a quan- 
tum wave function qrit) to a given time evolution of a 
classical probability distribution p,-(t), requires some addi- 
tional thought since qr [t) is fixed by Pr {t) only up to a sign 

Sr{t) 

q,{t) ^ Sr{t)^M^- (156) 

The choice of {sr{t)} can be largely fixed if we require that 
the time evolution of {qrit)} is given by a continuous and 
arbitrarily often differentiable rotation. Indeed, if <?,- (t) de- 
pends continuously on time for a given choice of Sr (t) , any 
other choice of Sr(t) would lead to a discontinuous jump 
of q-r whenever 7^ 0. For continuous qrit) the sign Sr{t) 
must be constant for all t where qrit) 7^ 0. However, Srit) 
may change sign for the zeros of qr ■ This typically happens 
if a continuous rotation implies a simple zero qrit) — 0, 
with sign(^qT{t + e)) — —sign{qr{t— tj) . More complicated 
cases can be discussed in a similar spirit. These arguments 
fix {sr{ty} for every choice of the initial values {sr{t = 0)}. 
Any other choice would imply a discontinuity in the deriva- 
tives of the parameters of the 0(A/')-rotations at some time 
t. Such discrete jumps are excluded if we require continu- 
ity and arbitrary differentiability of the time evolution for 

qr{t). 

The "history of signs" Sr{t) stores information that is 
available for the sequence of local-time probability distri- 
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butions {p-rit)}, but not for an individual probability distri- 
bution at fixed t. These signs matter for the computation of 
expectation values of off-diagonal operators. The sequence 
of local-time probability distributions is computable from 
the all-time probability distribution {puj}- We therefore 
have a map Vi — > S, where S denotes the local-time sub- 
system with a given evolution law, as characterized by the 
possible sequences of local probability distributions {prit)} 
that obey the evolution law. (Here Pi denotes the ensemble 
of all-time probability distributions for which the evolution 
law is valid.) For a unique construction of a wave function 
from classical probabilities we still need a map 5 — >■ Q, 
with Q the space of quantum states, parametrized here by 
the real wave functions {qrit)}. This association is still not 
unique, since the signs of the wave function {qrit = 0)} are 
not fixed uniquely. Only once the signs s-rit = 0) are fixed 
the signs for all other times Sr{t) are computable by the 
evolution law. 

If the index r contains a continuous part, as a posi- 
tion variable a; or a momentum variable p, the choice of 
Sr(t = 0) may be further restricted by requesting continu- 
ity and differentiability of Qrit) in x or p. One typically 
ends with a freedom of choice of a few signs for qT-{t — 0), 
and we will pick a fixed one as in eq. (|102[) for fixed s. This 
choice may be associated with a choice of gauge which does 
not matter for the expectation values of observables. For 
such a suitable fixed choice we can then map a given time 
evolution of classical probabilities {prit)} to a real quan- 
tum mechanical wave function {qr{t)}. This completes the 
map V ~> Q, which amounts to a construction of a quan- 
tum state in terms of classical probabilities. 

In turn, this wave function may find a complex represen- 
tation ipT-' {t) if the time evolution is compatible with the 
complex structures, i.e. if the 0(A/')-rotations belong to 
the unitary subgroup U{Af/2). Furthermore, the relevant 
observables of the problem should be compatible with the 
complex structure. In the next section we give an explicit 
example how such a complex structure arises from classical 
probabilities. 

Most familiar quantum systems are described by a com- 
plex wave function tp^i. We emphasize that the classical 
probabilities are not the absolute square IV'r'P- Specify- 
ing only the absolute squares discards part of the infor- 
mation contained in the classical probabilities Pr- Since 
\ipT'\'^ = ipir' , R)^ + iplr' , I)'^ the specification of IV'r' P can 
no longer resolve between p{t' , R) and p{t\ I) and there- 
fore looses phase information. 

It is often stated that the characteristics of quantum 
physics is encoded in the "physics of phases" that are 
needed beyond the probabilities. In our approach, most of 
the phase information is again associated to probabilities. 
The components of the real wave function qr contain the 
information for probabilities Pr- These are twice as many 
real numbers as IV'rP and therefore account for most of the 
phase information. What is missing beyond the probabili- 
ties Pr are only the signs Sr and one could phrase that the 
quantum state involves the "physics of signs" beyond the 
probabilities. These signs are needed for the computation 
of expectation values of off-diagonal operators. We have 



seen that part of the sign information is indeed coming 
from information about the sequence of local-time proba- 
bility distributions beyond a given time. Another part may 
be fixed by continuity, and the remainder corresponds to a 
gauge choice without impact on observations. 

2. Fixed time statistical ensemble and quantum 
state 

At this point it is useful to introduce the notion of the 
"fixed time statistical ensemble" . It is specified by the real 
wave function {qrit)} for a fixed time t. This contains 
somewhat more information than a fixed time classical 
statistical ensemble which would be characterized by the 
probability distribution {^^-(i)}, namely the signs {srit)}. 
These signs permit the computation of expectations values 
of real off-diagonal observables, besides the diagonal ones 
which only involve {prit)}. 

If we specify the states of the local-time subsystem by 
the evolution law and the wave function {qrito)}, the in- 
formation contained in the fixed time statistical ensemble 
and the state of the local-time subsystem is the same. We 
only have made a change of point of view by focusing on a 
given time t, and taking the evolution law as a dynamical 
statement how the fixed time statistical ensemble changes 
with time. For both the state of the local-time subsystem 
and the fixed time statistical ensemble the relative sign 
between different s-r that are not determined by the time 
evolution may be partly fixed by continuity and partly by 
gauge convention. 

If a suitable complex structure exists the fixed time sta- 
tistical ensemble can be associated directly with a pure 
state of a quantum system, as specified by a complex wave 
function. We will occasionally use the naming "classical 
wave function" for the real roots of the probabilities up to 
a sign, and "quantum wave function" for the usual com- 
plex wave function. The two are related by the presence of 
a complex structure. 

XII. COMPLEX STRUCTURE AND QUANTUM 
PHASES 

A simple example for a complex structure may be in- 
structive. Let us consider a four state system {B = 2,N = 
4) with time evolution of the classical probabilities accord- 
ing to 

Pi{t) = Pi,o, 

P2{t) = cos^ {ujt)p2Q + sin"^ {ujt)p3a 
-2 cos(wt) shi{ujt)y/p2oP3o, 

P3{t) = cos^ {u>t)p3o + sin"^ {ujt)p2Q 
+2 cos{u}t) am{uJt)^/p2oP3o, 

Piit) = P40- (157) 

Here Pr.o are the initial probabilities at t — 0, normalized 
by Pio + P20 + P30 + P40 = 1- This evolution can be de- 
scribed by a differential equation for the real wave function 

{(9t)},Pt = 

dtqi = dtqi = , dtq2 = ~ujq3 , dtq3 = ujq2. (158) 
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The quantum Hamiltonian of this four state system is 
purely imaginary 



H 



/O, 0, 0, 0\ 

0, 0, -i, 

0, i, 0, 

\0, 0, 0, 0/ 



(159) 



We observe the close analogy with sect. I Villi with q2 
and (73 playing the role of qi and go in eq. ([M)) . How- 
ever, we have now a four-state system with other possi- 
ble observables. It admits a Grassmann representation 
different from sect. IVIIII A suitable Grassmann basis is 
{qt} — (1, V'l: V'2, V'iV'2): with Hamiltoniau 



n = -iu}-—i)2 + iuj——ipi 



d 



a,/3 



TT— (T2)a^?/'/3- (160) 



The time evolution (|158l) can be described in a complex 
basis 

This corresponds to a spin in a constant magnetic field. 
The complex structure is specified by the definition of q. 
Complex conjugation corresponds to an involutive map 
where the sign of q^ and qn is flipped. Multiplication by i in 
the complex basis corresponds in the real basis to the map 
qi -q-i,qi qi,q2 ^ -<?3,g3 -> <?2- In the complex 
basis the Hamiltonian is now a real hermitean operator. 

The particle numbers Afa = g:f^'0a are represented 
as linear operators acting on the real wave function 
(91, 92, 93, 94) as 



Ni = diag{l,0, 1,0) , N2 = diag{l, 1,0,0). 



(162) 



In the complex basis (jl6ip no linear representation as com- 
plex 2x2 matrices exists. These observables are not com- 
patible with the complex structure. This also holds for the 
occupation number of the second component 



ma = diag{0, 1,0,0) = iV2(l - A^i)- 



(163) 



This particular occupation number corresponds to N in 
sect. IVml replacing eq. ^ by {N{t)) = p2(0- The 
total particle number Ntot ^ Ni + N2 = diag{2, 1, 1, 0) is 
conserved {[Ntot,H] = 0), but again it is not compatible 
with the complex structure. 

An example for an observable which is compatible with 
the complex structure is 



D = diag{0, 1, 1, 0) (A^i - iVa)^ 



(164) 



It is represented in the complex basis as (1 — r3)/2. This 
two-level observable takes the value one if A^i and N2 are 
different, and zero if they are the same. This "difference 
observable" commutes with the Hamiltonian {[D,H] = 0) 
and is therefore conserved. In the complex basis the her- 
mitean operators can be written as linear combinations of 



the Pauli matrices Tk and the unit operator. In the real ba- 
sis (gi, 92, 93, 94) these operators are mapped to — f^. 



fl = 1 (g) Tl , f2 =Tl<^T3 , f 3 = Ta (8) T3, 



(165) 



while multiplication with i in the complex basis appears as 
a matrix multiplication with / in the real basis. 



/ = —iT2 (g) Tl. 



(166) 



One wonders if besides f 3 = 1 — 2D also the operators fi 
and f2 can find a physical interpretation. They correspond 
to off-diagonal observables. 



XIII. CLASSICAL AND QUANTUM PARTICLES 

Our construction of classical statistical ensembles that 
admit a unitary time evolution law can easily be extended 
to a higher number of bits B, using a purely imaginary 
hermitean Hamiltonian acting on a real wave function 
{g7-},T = 1...2-^. In the limit B ^ 00 the wave func- 
tion can depend on continuous variables. For example, we 
may consider real wave functions in phase space, q(x,p), 
with x,p continuous position and momentum variables. A 
possible Hamiltonian reads {y'{x) = dxV{x)) 



Hr 



i-d. - V'{x)dp 



(167) 



The time evolution of the classical probability density 
w{x,p) = q'^{x,p). 



dtw{x,p) 



P 



dxw{x,p) + V\x)dpw{x,p) (168) 



is the Liouville equation for the probability distribution of 
classical particles moving in a potential V{x). (In this sec- 
tion we denote probabilities by w (instead of p) in order to 
avoid confusion with momentum.) We recover the quan- 
tum formalism for a probabilistic description of a classical 
particle in terms of a classical wave function |1^ . 
A different Hamiltonian 

Hw = -i^dx + v(^+'-d^-v(^x-^-d^ (169) 

can be used for a description of a quantum particle in terms 
of a classical probability distribution in phase space [Tsj . 
From the classical wave function q{x,p) we can define a 
Wigner function [H, [2^ 



r.r .s.s 



Pw{x,p) ^ / 9 (^+2'^ + '*j ^^'^^^ 

q ^.T + — ,p + s'^ cos(sV — sr'). 



Its partial Fourier transform 



p(x,y) = J^e^P<-^-y^p^[^^y,p 



(171) 
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has all the properties of a quantum mechanical density ma- 
trix in position space. Translating the time evolution of 
q(x,p) with the Hamiltonian Hw (|169p to p{x,y) accord- 
ing to eqs. (|170l) . (|17ip yields the von-Neumann equation 



quantum wave function, is represented by an off-diagonal 
operator, obeying 



idtp = [Hq,p] , Hq = -—dl + v{x) 



(172) 



We can define quantum observables for position and mo- 
mentum that act on the wave function q{x,p) as 

XQ=x+'-dp , Pq=p- ^a, , [Xq, Pq] = ^. (173) 

The expectation values of all symmetrized products of Xq 
and Pq are precisely the ones for a quantum particle in 
a potential V(x). In particular, they can be expressed in 
terms of the Wigner function as 



{F,{Xq,Pq)) 



F{x,p)pw{x,p). 



(174) 



This demonstrates explicitly that an appropriate unitary 
evolution law for classical probabilities can describe the 
dynamics of a non-relativistic quantum particle in an arbi- 
trary potential. Arbitrary quantum states with arbitrary 
quantum density matrices can be described in this way for 
suitable initial conditions of the phase-space wave function 
q{x,p;to). This includes pure quantum states as special 
cases. For pure quantum states the wave function in phase 
space coincides with the Wigner function 



Pw{x,p) = qix,p). 



(175) 



For pure quantum states an alternative and more direct 
description of a non-relativistic quantum particle in a po- 
tential can be given along the lines of sects. El |Xll We 
use one bit for the labeling of the real and imaginary parts 
of a complex wave function tpr'it). Then we associate r' 
with X (in the position representation), instead of points in 
phase space. This yields the usual complex quantum wave 
function as ijjix, t). In this formulation the Hamilton oper- 
ator is given by Hq in eq. (|172l) . The existence of several 
pictures for the quantum particle illustrates that the time 
evolution for a given set of observables can be obtained in 
different ways. Actually, the second description can also 
be obtained from a coarse graining of the first description 

The reader should note that "diagonal" and "off- 
diagonal" observables have a different relation to the local- 
time classical probabilities. In our second implementation 
of a quantum particle the probabilities Prit) are given by 
the real and imaginary parts of the quantum wave function 
'ip{x) = i'Rix) -\-i'ipi{x) as pi{x;t) = ipj^{x;t),p2{x;t) = 
ipj{x;t). The diagonal position observable Xq is simply 
represented as x, with 

ifiXQ))^ [ f{x){p,ix)+p2{x)) (176) 

behaving as a standard observable in classical statistics. In 
contrast, the momentum operator Pq acting as —idx on the 



(Pq) = / Sl{x)s2{x){^y Pl{x)dx\/P2{x) 
J X 

-VM^d^y/M^), (177) 

where 

iPr{x) = Si{x)y/pi{x) , Iplix) = S2ix)y/p^{x). (178) 

First of all, the expression ()177|) is no longer linear in the 
classical probabilities as for a standard classical statistical 
observable. Second, it involves the product of sign func- 
tions Si{x)s2{x) = ±1. While the sign functions are, in 
principle, fixed by the time evolution and therefore com- 
putable in terms of the all-time probability distribution 
{puj} (except for a gauge freedom not relevant for physical 
observables), they nevertheless contain some sign informa- 
tion from the past/future beyond the information stored in 
the local probabilities Pi{x). Even though this information 
concerns only the sign, it is needed for the computation of 
the off-diagonal observable Pq . 

For our first formulation of a quantum particle both Xq 
and Pq in eq. (|173|) are off-diagonal observables. In this 
case the relevant part of the sign function s{x,p) is com- 
putable in terms oiw{x, p) . Thus arbitrary functions of the 
quantum operators Xq and Pq at time t are computable in 
terms of the local probability distribution alone. The ex- 
pressions are not linear in however. They involve statis- 
tical observables measuring the roughness of the probabil- 
ity distribution 15|. In practice, the computations become 
much easier if the information about s{x,p) is kept dur- 
ing the evolution as an additional ingredient rather than 
computing it from analyticity properties of w{x,p). 

We conclude that the sign functions should be regarded 
as a genuine part of the information for the computation of 
local observables. In this sense the basic object at a fixed 
time t is the wave function {qT{t)} rather than the set of 
probabilities {pT{t)} — {9t(^)}- With this philosophy we 
may consider in the preceding section spin observables in 
arbitrary directions, in particular the ones represented by 
■?i:'?2,'?3- Once off-diagonal observables are included one 
can define a new product by the multiplication of operators. 
It differs from the product of classical observables and can 
therefore account for non-commutative structures. On the 
other hand, the classical product cannot be computed from 
the local wave function, as we have seen explicitly for N 
and dtN or N{t) and N{t + a) in sect. HXl 



XIV. 



QUANTUM FIELD THEORY FOR 
FERMIONS 



Our description of fermions in sect. I VIII can be used 
directly for an implementation of a quantum field theory 
for fermions in a Hamiltonian framework. (The associated 
Grassmann functional integrals are discussed in ref. [2l|.) 
For this purpose the index a labeling the different fermion 
spaces is written as a double index a — {x, s), where x is a 
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position coordinate and s labels internal degrees of freedom 
for the fcrmions. (Alternatively we may choose a = [p, s) 
for momentum space.) In distinction to a single quantum 
particle where t = x, the space label x denotes here the 
Grassmann variables tjja and not the Grassmann element 
Qt- We work here with a complex Grassmann algebra. 

We can define creation and annihilation operators for a 
fermion of species s at :e 



as{x) = tjjsix) , al{x) = 



d 



They obey the anti-commutation relations 

{a,{x),as'{x')} = {al{x) , al,{x')} =0, 
{al{x),as'{x')} = 5ss'5{x-x'). 



(179) 



(180) 



For formal precision, we may imagine here a discrete space 
lattice, X = ne,n = (ni, n2, na), rife e Z, such that 6{x — 
x') = „/ ,n'^ , ^n3,n'^ ■ The particle number for a species 
s at position x reads 



^fs{x) 



d 



dtpsix) 



4>s{x) = al{x)as{x). (181) 



The eigenvalues oiAfs{x) are one or zero, with (^Afs{x)) = 
J\fs{x). All particle numbers commute 

[Afs{x),Afs>{x')]^0. (182) 

For later purposes we define the operators 

Mst{x,y) = al{x)at{y), (183) 

with J\fs{x) = A4ss{x,x), and commutation relations 

[Mst{x,y),Ms't'{x',y')] = (184) 
Ss'tS{x' - y)Mst'{x,y') - 6st'S{x - y')Ms't{x' ,y). 

One can change to a momentum basis by 



,{x) = [ e^P^Mp), (185) 
Jv 



such that the annihilation and creation operators for a 
fermion with momentum p become 

as{p) = [ e-'P-a,{x) , al{p) = f e'^^alix), (186) 

J X J X 

with commutation relation 

{al{p),as'ip')} = dss'd{p-p'). (187) 

(Appropriate factors 27r are included in = d^p/{2-K)^ 
and 5{p-p') = {2tt)^6^{p-p'). Alternatively, both and 
S{p) can be formulated for discrete momenta on a torus.) 
The particle number for momentum p 



K{p) = al{p)as{p) 



Jx,y 



V) 



Mss{x,y) (188) 



obeys 

[Kip),K'{p')]=0, (189) 
but it does not commute with J\fs{x), 

[N-sip),K'ix)] ^6ss' ([e-'^'^^-y^MUy.x) 

-e'P^-v)Mss{x,y)\. (190) 
The total particle number obeys 

N = Y, [m{x) = ^ [Af{p) (191) 

and commutes with both Af{x) and Af{p) . 

We can define the position and momentum operators for 
a multi-fermion system with (A'') > as 

Xk = {N)-'J2 I ^kK{x), 

Pfe = (A^)-^^ / PkKip) 

where das{x) / dxj~ can be defined by the diff'erence of op- 
erators at two neighboring points. They obey the commu- 
tation relation 



. N 



5ki- 



(193) 



For eigenstates of A^ the commutator vanishes ^ 1/A^ for 
N ^ oo. At this point we do not question about the basic 
origin of non-commuting operators and their association to 
observables. This issue concerns the all-time probability 
distribution which we do not address in this section. We 
simply note that all these operators act in a well defined 
way on any given Grassmann wave function g{t). 

In particular, wc may consider one-particle states. They 
correspond to eigenstates of total particle number with 
eigenvalue one, 



Ngit) = git). 



(194) 



The commutator [Afc,P/] = iSki takes now its standard 
form. It is straightforward to construct the eigenstates 
with A^ = 1. For S species fermions (s = 1 ... 5) and L 
lattice points x,F = LS, all contributions to g must have 
precisely F — 1 factors of tps (x) . We may denote by i/jg (x) 
the Grassmann elements to which ips{x) is conjugate, 

Jvi;i;t{y)Mx) = {-lf-^Jvi,Mx)My) 

= SstSix-y), (195) 

which obey 



J^s{y)i't{x) = SstS{x-y)i>t{x), 
N^t{x) = Mx). 



(196) 
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They can be obtained from the vacuum state 

\o) = ■ ■ ■M^i)M^2) . ■ . (197) 

(with some fixed ordering of the lattice points Xi) by 



d 



|0). 



(198) 



It is convenient to use here a different "fermion conven- 
tion" for the choice of signs of the basis elements of the 
Grassmann algebra. We will take a plus sign for all or- 
dered products of Tps (x) — Tpa for which the lower a are to 
the right. 

The most general eigenstate with iV = 1 then reads 
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(199) 



The coefficients of the states with fixed Afg (x) = 1 can be 
associated with the components of a complex wave function 
ips{x). It is normalized according to 



(200) 



/ 'f*six)'Ps{x) = 1, 
such that the "one- hole state" , 

hi=Y I ip*{x)ips{x), 

is conjugate to gi, i.e. hi — gi, or 

Viphigi = 1. (201) 



For an arbitrary Grassmann operator A acting on gi we 
may define an associated operator A acting on ips (x) by 

•^91 ^Y. I Ast{x,y)vt{y)^s{x). (202) 

This guarantees 

{A) = / Vi;~giAgi 



Y / '^*r{z)Ast{x,y)(pt{y)A{z)^s{x) 



Y / '^*si^)^st{x,y)ipt{y), 

s,t •''^'V 



(203) 



which amounts to the standard quantum rule for expecta- 
tion values in terms of associated operators. In particu- 
lar, the position and momentum operators (|192p are rep- 
resented as 

Xk = 5{x - y)yk , Pk = -iS{x -y)^. (204) 

<jyk 

A simple proof uses the identities 

il^s{.x)My) - 5st6{x - j/)|0) , PVIO) = 1. (205) 



For one species {S = 1) we recover the quantum particle of 
the preceding section. 

As a particular example for an evolution equation we 
may consider a system of electrons and positrons in an 
external electromagnetic field. The Hamiltonian reads 
(5^4) 



{m((pV + X^X) - eAo(¥'V - X^X) 

+i{ip^TkdkX* + X^TkdkV) 
+eAk{ip'^ TkX* +X^'rkip]. 



(206) 



Here (p and x are two component Grassmann variables for 
quasi-electrons and quasi-positrons, respectively 



Lpi{x) 
(P2{x) 



, X 



Xi{x) 

X2{x) 



(207) 



while ip*,x* involve Grassmann-derivatives ((^^ — (93*)"^) 

, _ / d/dipi{x) \ * _ f d/dxi{x) \ , . 

-[d/d^x) ) ' ^ -[d/dx2{x))- ^20^) 

The space-derivative of a Grassmann-derivative is defined 

as 



dkXl 



lini — 



d 



d 



dxi{x- 



dxi{x) 
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and A^ = (Aq, Ak) denotes the electromagnetic potentials. 
The coinvariance of the corresponding evolution equation 
under Lorentz and gauge transformations is not easily visi- 
ble in this formulation - the proof becomes straightforward 
for the associated Grassmann functional integral which is 
discussed in ref. [2l|. The last terms in eq. (|186p anni- 
hilate or create simultaneously one quasi-electron and one 
quasi-positron. The total number of quasi-electrons plus 
quasi-positrons is not conserved. As is well known, there 
exists a modified total particle number which is conserved 
in the absence of electromagnetic fields. The Hamiltonian 
formulation of the time evolution can become rather cum- 
bersome for strong fields where particle production plays 
a role. Many aspects are simpler in a functional integral 
formulation. 



XV. CONCLUSIONS 

Time can arise as a probabilistic structure for a suitable 
classical statistical ensemble. Since the concept of time 
is not introduced a priori, a probability distribution {pu>} 
characterizing such an example cannot refer to any par- 
ticular moment in time. It rather describes the complete 
possible probabilistic information about reality, covering 
the past, present and future, as well as possible situations 
for which time is only defined approximately and may loose 
its meaning under certain circumstances. 

A time structure for {p^} requires an ordering structure 
for observables which permits to introduce the concept of 
local probabilities (local in time) by integrating out the 
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past and future. The sequence of local-time probabilities 
{Prit)} for different t can be viewed as characterizing a 
state of a subsystem of the larger system described by the 
all-time probability distribution {puj}, namely the local- 
time subsystem. While the all-time probability distribu- 
tion {pu} contains, in principle, the information about all 
times, the local probability distribution {prit)} refers to 
a particular time. The expectation values of local observ- 
ables A{t) can be computed from {prit)} without requir- 
ing information concerning the past and future. If only 
the information about the local probability distribution is 
available or needed we deal with incomplete statistics ^12] 
for the local-time subsystem. 

Predictivity for the expectation values of observables 
at future times in terms of the expectation values at the 
present time requires the existence of an evolution law. 
Typically, such an evolution law is a differential equation 
for the time evolution of the local probabilities Prit)- For 
a given all-time probability distribution {puj} the evolution 
law can be computed. We have presented explicit examples 
for all-time probability distributions {p^} which result in 
an oscillating evolution of the local-time probability dis- 
tribution {pr{ty}. We have also shown that an arbitrary 
time evolution of {Pt(0} can be implemented by a suit- 
able {pcj}- Typically, many different {pu:} can lead to the 
same time sequence of local probabilities {prit)}. Such 
time sequences therefore define equivalence classes for all- 
time probabilities {pu}- The expectation values of the local 
observables as well as suitable correlations depend only on 
the equivalence class, such that the full information con- 
tained in a particular all-time probability distribution {pu} 
is not needed. 

We concentrate on a suitable unitary evolution law which 
describes rotations of a real unit vector qrit). The local 
probabilities are related to the "wave function" qrit) by 
Prii) — <It{^)^ such that positivity, Prit) > 0, and nor- 
malization, '^T-Prit) = 1, are easily implemented. The 
sign ST-{t) of Qrit) — ST-{t)y^pT-{t) is largely fixed by the 
analytic properties of continuous rotations as well as possi- 
ble analytic properties with respect to a continuous index 
r. Remaining ambiguities in the choice of {srit)} without 
physical relevance can be considered as a choice of gauge. 
The wave function {qrit)} at a given time t, together with 
the evolution law contains essentially the same informa- 
tion as the sequence of local-time probabilities {pr{t)}. In 
particular, the sequence {prit)} contains (partial) informa- 
tion about the distribution of sign functions {sT-{t)} which 
is needed for the computation of expectation values of off- 
diagonal observables. We consider the signs as a genuine 
part of the local-time information and take the wave func- 
tion {qrit)} rather than the probabilities {prit)} as the 
basic ingredient for the description of physics at a given 
time t. For any given classical statistical ensemble {puj} 
the wave function {qrit)} is, in principle, computable (up 
to irrelevant gauge choices). From {qrit)} one can compute 
directly the expectation values of observables A{t + a) at 
times different from t. These observables are associated to 
off-diagonal operators, similar to the Heisenberg picture in 
quantum mechanics. 



The unitary evolution law constitutes a direct implemen- 
tation of the quantum mechanical wave function in terms of 
the classical probability density {pui}. The quantum wave 
function at a given time t is directly related to the sequence 
of local probability distributions {prit)}. The components 
of a real quantum wave function are simply given by qrit). 
This is no limitation since a complex wave function -ip^' {t) 
can always be written as a real wave function with twice 
the number of components. The unitary evolution equa- 
tion for the local probabilities becomes the Schrodinger 
equation for the wave function in quantum physics. All 
quantum rules for the computation of expectation values 
of observables apply. Quantum physics results from classi- 
cal probabilities which admit a time structure with unitary 
evolution law. 

A further fundamental concept of quantum physics and 
quantum field theory arises naturally from our formulation 
of probabilistic time, namely fermions and Grassmann vari- 
ables. We have labeled the states u of the classical ensem- 
ble by the eigenvalues of two-level observables which corre- 
spond to yes/no questions. Correspondingly, the states r 
for the local-time subsystem (with associated probabilities 
Prit)) can be labeled by the eigenvalues of local number 
operators Na (t) which can take the values zero or one. As- 
sociating Na (t) = 1 with a fermion of type a to be present 
at the given time t, and Na{t) = with no fermion present, 
the states t are basis states for the quantum wave function 
{qrit)} for a multi-fermion system in the occupation num- 
ber basis. 

We have presented various explicit examples for a quan- 
tum time evolution arising from a unitary evolution law for 
a classical statistical ensemble. This covers two-state quan- 
tum mechanics as well as quantum particles in a potential. 
Furthermore, we also have discussed quantum field theo- 
ries for fermions. The expectation values of all diagonal 
observables can be computed from the local-time probabil- 
ities Prit) in the standard way for classical observables, i.e. 
by expressions linear in Prit). The off-diagonal observables 
involve, in addition, the signs Sr (t) and depend on pr (t) in 
a non-linear manner. 

Open issues remain. We have shown by explicit construc- 
tion that all-time probabilities {pu}} exist for any given se- 
quence of local probabilities {prit)}, and that many differ- 
ent {pu} lead to the same sequence. However, we have not 
yet searched for interesting realizations of all-time proba- 
bilities that would shed light on the emergence of particular 
evolution laws. In this context, we also have not yet found 
a simple direct construction of the wave function {q^-it)} 
in terms of {pi^}, which would supplement the simple rule 
pi)) for the local probabilities Prit). 

The non-uniqueness of the all-time probability distribu- 
tion for a given time sequence of probabilities {prit)} or 
wave function {qrit)} raises another important question. 
One may ask which unequal time correlation functions 
can be defined in terms of the sequence of wave functions 
{qrit)}- Such correlation functions differ from the classical 
correlation function. They are rather robust with respect 
to the details of {pcj} since they only depend on equivalence 
classes of time sequences {qrit)}- Furthermore, they only 
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depend on equivalence classes of observables and are there- 
fore robust with respect to the precise choice of observables 
as well. This robustness suggests that actual measurement 
correlations are based on conditional probabilities related 
to such correlations. Correlations of this type have been 
discussed in a euclidean setting in ref. [ill and found to 
be related to non-commutative operator structures. The 
present paper advances these ideas since a unitary time 
evolution is implemented directly in terms of p robabilities, 
and not only by analytic continuation as in |12| . We discuss 
the appropriate correlation functions explicitly for simple 
systems. They are associated to quantum operators and 
the associated operator product. 

Finally, one may wonder about the origin of the om- 
nipresence of non-commuting observables in quantum 
physics. We have shown previously [isj how non- 
commuting observables can arise from standard classical 
observables for a classical statistical ensemble that com- 
prises a subsystem and its environment. The system ob- 
servables correspond to equivalence classes which reflect 
identical properties of the system, but different properties 
of the environment. The lack of commutativity reflects that 
the joint classical probabilities are not properties of the 
equivalence classes, as characteristic for incomplete statis- 
tics. The non-commutativity is related to a new product 
structure involving the equivalence classes. It is different 
from the commuting classical product. This new prod- 
uct defines correlations different from the classical corre- 
lations. While the general principle for the emergence of 
non-commuting observables is known and some examples 
for non-commuting operators have been discussed in this 
paper, the present work has made no systematic attempt 
to construct non-commuting observables from underlying 
classical observables. Only for some simple cases we have 
related to non-commuting quantum operators directly to 
classical observables in the all-time ensemble. 

Despite these open points we find it remarkable in which 
simple way the basic concepts of quantum physics, as 
the unitary time evolution of probability amplitudes, non- 
commuting operators or fermions, arise from the formula- 
tion of the concept of time in a general probabilistic setting 
of classical statistics. 



APPENDIX: EVOLUTION OPERATOR AND 
QUANTUM FORMALISM 

1. Grassmann evolution operator 

In the Grassmann representation the close correspon- 
dence of the "unitary evolution law" given by eqs. ((53|). 
(j58p or (|57p to the time evolution in quantum mechan- 
ics can be made apparent. For this purpose we em- 
ploy operators U(t,tQ) acting in the Grassmann algebra. 
They describe the time evolution of a Grassmann element 
g{t) = Y.r'^'r{t)gT according to 

g{t) = U{t, to)g{to) , m - li^{t, toMh). (210) 

Here we define U'^{t,tQ) such that for arbitrary ele- 



ments g, f of the Grassmann algebra one has 

J Vi^U^gf = J VibgUf. (211) 

The associated time evolution of pr (t) obtains for a given 
evolution operator U{t,to) as 

Prit) = j V^g{t)V,g{t) 

= J Vi;U^{t,to)gito)Vrll{t,to)g{to). (212) 

We may express the linear operators U,U^ as matrix 
multiplications 

ll{t,to)g{to) = ^Urp{t,to)Cp{to)gr, 

U^it, toVgito) = E ^rp{t, to)c;{torgr, (213) 

with 

Urp = Upr, (214) 

such that 

Prit) ^ J V^J^ UlpC*pUraC„9X9T 
A.p.cj 

= ^C*p{to)Upr{tM)Ura{tM)Ca{tQ). (215) 
p,a 

From X^r^T ~ 1 infer 

Y,{UU)p„c;c„ ^ 1, (216) 

which can hold for arbitrary Cr obeying |crp = 1 only 
if {UU)pa — 5p„. Thus the matrix U is the inverse of U . 
Furthermore, the condition that Prit) is real is obeyed by 
eq. ((2T5)) if U'^ = U*. Then p^(t) > follows automati- 
cally 

pAt)^\Y,UrAt,to)Ca{to)\^. (217) 

(7 

We conclude that [/ is a unitary matrix, UW = I, U = 
W. Already at this stage we recognize the unitary time 
evolution operator of quantum mechanics. The Grassmann 
operator U plays a role similar to the unitary evolution 
operator in quantum mechanics, 

U{t,to)U{t,t„) = lg. (218) 

For the case of real Cr — Qt we have to require that 
qr = J2p UrpQp is real. This restricts the unitary trans- 
formations to the subgroup of real matrices U. Those are 
the rotations SO{Af), or [/ = R,R^R = 1, and we re- 
cover eq. ([57| if we consider the gr as a fixed basis of the 
Grassmann algebra. 
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We note that the Grassmann algebra admits an involu- 
tion 



Here we have introduced the quantum mechanical "density 
matrix" 



^ Qr , Cr ^ , g ^ g. 



(219) 



This can be used in order to define the notion of a complex 
conjugation in the Grassmann algebra 



(220) 



which is a nontrivial operation even for real = It- In 
terms of this complex structure we can regard lA as the 
hermitean conjugate oilA. 

2. Schrodinger equation 

The analogy to quantum mechanics is apparent if we 
interpret the Grassmann- valued wave function 5 as a vector 
with components 



ipr{t) = Vrg{t) = Cr{t)gr. 



(221) 



The complex conjugate wave function g* — g has compo- 
nents 

Mt) = 'Prm - <{t)9r - Kit). (222) 

In terms of the components we can write 

Pr{t) = I V^^r{tWr{t). (223) 



Expectation values of observables find an expression similar 
to quantum mechanics, 

(Ait)) = 1 2?V^^^r(i)-4¥'r(0, (224) 

with A acting as a diagonal operator A<^t — A^^t- 

In this picture the time evolution can be written as the 
usual unitary evolution of the wave function 

'Vrit) = ^Urp{t,ta)(Pp{tQ), 
P 

cprit) = Y.^;p{t,to)Vpito) , U^U^l, (225) 
p 

which yields explicitly 

Pr{t) = J V^^r{t)Vr{t) 

p,a •' 

= J2Kp{t,to)Ura{t,to)pap{to) 
p,a 

= {U{t,to)p{to)U\t,to))^^^{p{t))^^. (226) 



Prcr = J V^ifrfa , P ^ p'' , tr/9 = 1, (227) 

with real diagonal elements prr = Pt obeying < prr < 1- 

The quantum mechanical formalism can also be pre- 
sented in the standard way by omitting the basis vectors gr 
and defining complex vectors ip and if* with components 
Cr,c*, with associated density matrix Pt-o- — CrC*^- The 
classical observables can be represented as diagonal opera- 
tors A, {A)ra — ArSrp, and we can define the usual scalar 
product (with \(p) = ip, {ip\ = cp*, such that 



{A) = {^\A\K = tr(pi). 



(228) 



Even though we describe a classical statistical ensemble, 
the unitary time evolution ((53|) , ([57|) can be fully described 
with the formalism of quantum mechanics. Of course, at 
this stage the observables all commute and are all repre- 
sented by diagonal operators. Furthermore, the wave func- 
tion Lp remains real if we use the real Grassmann algebra 

with Ct- = C* = Qr- 

The time evolution of the Grassmann wave function g{t) 
obeys a differential evolution law 



idtg{t) = H{t)g{t) 



(229) 



with Hamilton operator H defined by eq. (|93p . In terms 
of the components of the wave function eq. (|229p becomes 



P 

ldtCr{t) = ^i/,p(<)Cp(t). 



(230) 



This shows explicitly that the Schrodinger equation can 
be obtained from an appropriate evolution law for classical 
probabilities. 
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